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HUMAN HOMOLOG OF THE DROSOPHILA PROTEIN "FUSED" 

FIELD OF THE INVENTION 
5 The present invention relates generally to signaling molecules, specifically to signaling and 

mediator molecules in the hedgehog (Hh) cascade which are involved in cell proliferation and differentiation. 

BACKGROUND OF THE INVENTION 
Development of multicellular organisms depends, at least in part, on mechanisms which specify, 
direct or maintain positional information to pattern cells, tissues, or organs. Various secreted signaling 

10 molecules, such as members of the transforming growth factor-beta (TGF-p), Wnt. fibroblast growth factors 
and hedgehog families have been associated with patterning activity of different cells and structures in 
Drosophila as well as in vertebrates. Perrimon, Cell: 80: 5 1 7-520 ( 1 995). 

Hedgehog (Hh) was first identified as a segment-polarity gene by a genetic screen in Drosophila 
melanogaster, Nusslein-Volhard et al, Roux. Arch. Dev. Biol. 193: 267-282 (1984). that plays a wide variety 

15 of developmental functions. Perrimon, supra. Although only one Drosophila Hh gene has been identified, 
three mammalian Hh homologues have been isolated: Sonic Hh (SHh), Desert Hh (DHh) and Indian Hh 
(lHh), Echelard el al, Cell 75: 1417-30 (1993); Riddle et al, Cell 75: 1401-16 (1993). SHh is expressed at 
high level in the notochord and floor plate of developing vertebrate embryos. In vitro explant assays as well 
as ectopic expression of SHh in transgenic animals show that SHh plays a key role in neuronal tube 

20 patterning, Echelard et al., supra., Krauss et al.. Cell 75, 1431-44 (1993), Riddle et al., Cell 25: 1401-16 
(1993), Roelink et al, Cell 8 1 : 445-55 (1995). In vitro explant assays as well as ectopic expression of SHh in 
transgenic animals show that SHh plays a key role in neural tube patterning, Echelard et al. (1993), supra.; 
Ericson et al.. Cell 8L 747-56 (1995); Marti et al., Nature 375: 322-5 (1995); Roelink et al. (1995), supra; 
Hynes et al, Neuron \ 9;. 1 5-26 ( 1 997). Hh also plays a role in the development of limbs (Krauss et al.. Cell 

25 75: 1431-44 (1993); Laufer et al.. Cell 79, 993-1003 (1994)), somites (Fan and Tessier-Lavigne, Cell 79, 
1 175-86 (1994); Johnson et al. Cell 79: 1 165-73 (1994)), lungs (Bellusci et al. Develop. J24: 53-63 (1997) 
and skin (Oro et al., Science 276: 817-21 (1997). Likewise, IHh and DHh are involved in bone, gut and 
germinal cell development, Apelqvist et al., Curr. Biol. 7: 801-4 (1997); Bellusci et al, Development. 124 : 
53-63 (1997); Bitgood et al., Curr. Biol. 6: 298-304 (1996); Roberts et al.. Development 121; 3163-74 

30 (1995). SHh knockout mice further strengthened the notion that SHh is critical to many aspect of vertebrate 
development, Chiang et al, Nature 383 : 407-13 (1996). These mice show defects in midline structures such 
as the notochord and the floor plate, absence of ventral cell types in neural tube, absence of distal limb 
structures, cyclopia, and absence of the spinal column and most of the ribs. 

At the cell surface, the Hh signals is thought to be relayed by the 12 transmembrane domain protein 

35 Patched (Ptch) [Hooper and Scon, Cell 59. 75 1 -65 ( 1 989); Nakano et al. Nature 34k 508- 13 (1 989)] and 
the G-protein coupled like receptor Smoothened (Smo) [Alcedo et al.. Cell 86: 221-232 (1996); van den 
Heuvel and Ingham, Nature 382: 547-551 (1996)]. Both genetic and biochemical evidence support a 
receptor model where Ptch and Smo are pan of a multicomponent receptor complex, Chen and Struhl, Cell 
87: 553-63 ( 1 996); Marigo et al. . Nature 384: 1 76-9 ( 1 996); Stone et al., Nature 384: 1 29-34 ( 1 996). Upon 
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binding of Hh to Pich. the normal inhibitory effect of Ptch on Smo is relieved, allowing Smo to transduce the 
Hh signal across the plasma membrane. Loss of function mutations in the Ptch gene have been identified in 
patients with the basal cell nevus syndrome (BCNS), a hereditary disease characterized by multiple basal cell 
carcinomas (BCCs). Disfunctional Ptch gene mutations have also been associated with a large percentage of 
sporadic basal cell carcinoma tumors, Chidambaram et al. Cancer Research 56: 4599-601 (1996); Gailani et 
al. Nature Genet. U: 78-81 (1996); Hahn et ai. Cell 85: 841-51 (1996); Johnson et ai. Science 272: 1668- 
71 (1996); Unden et ai, Cancer Res. 56: 4562-5 (1996); Wicking et ai. Am. 1 Hum. Genet. 60: 2 1-6 ( 1997). 
Loss of Ptch function is thought to cause an uncontrolled Smo signaling in basal cell carcinoma. Similarly, 
activating Smo mutations have been identified in sporatic BCC tumors (Xie et ai, Nature 391.: 90-2 (1998)), 
emphasizing the role of Smo as the signaling subunit in the receptor complex for SHh. However, the exact 
mechanism by which Ptch controls Smo activity still has yet to be clarified and the signaling mechanisms by 
which the Hh signal is transmitted from the receptor to downstream targets also remain to be elucidated. 
Genetic epistatic analysis in Drosophila has identified several segment-polarity genes which appear to 
function as components of the Hh signal transduction pathway, Ingham, Curr. Opm. Genet. Dev. 5: 492-8 
15 (1 995): Perrimon, supra. These include a kinesin-like molecule, Costal-2 (Cos-2) [Robbins et al.. Cell 90: 
225-34 (1997); Sisson et al.. Cell 90: 235-45 (1997)], a protein designated fused [Preat et ai. Genetics L35: 
1047-62 (1993); Therond et al., Proc. Natl Acad Sci. USA 93: 4224-8 (1996)], a novel molecule with 
unknown function des.gnated Suppressor effused [Pham et al. Genetics 140: 587-98 (1995); Preat. Genet.cs 
132 ; 725-36 (1992)] and a zinc finger protein Ci. [Alexandre et al., Genes Dev. 10: 2003-13 (1996); 
20 Dominguez et al., Science 272: 1621-5 (1996); Orenic et al, Genes Dev. 4: 1053-67 (1990)]. Additional 
elements implicated in Hh signaling include the transcription factor CBP [Akimaru e, al.. Nature 386: 735- 
738 (1997)], the negative regulator slimb [Jiang and Struhl, Nature 391: 493-496 (1998)] and the SHh 
response element COUP-TF1I [Krishnan et al., ScienceTW. 1947-1950 (1997)]. 

Mutants in Cos-2 are embryonicly lethal and display a phenotype similar to Hh over expression. 
25 including duplications of the central component of each segment and expansion domain of Hh responsive 
genes. In contrast, mutant embryos for fused and Ci show a phenotype similar to Hh loss of function 
including deletion of the posterior part of each segment and replacement of a mirror-like image duplication 
of the anterior part or each segment and replacement of a mirror- 1 ike duplication of the anterior part, Busson 
et al, Roux. Arch. Dev. Biol J97: 221-230 (1988). Molecular characterizations of Ci suggested that it is a 
30 transcription factor which directly activates Hh responsive genes such as Wingless and Dpp, Alexandre et al. 
(1996) supra, Dominguez et al. (1996) supra. Likewise, molecular analysis of fused reveals that it is 
structurally related to serine threonine kinases and that both intact N-terminal kinase domain and a C- 
terminal regulatory region are required for its proper function, Preat et al. Nature 34J: 87-9 (1990); Robbins 
et al, (1997). supra; Therond et al. Proc. Natl. Acad. Sci. USA 93: 4224-8 (1996). Consistent w.th the 
35 putative opposing functions of Cos-2 and fused, fused mutations are suppressed by Cos-2 mutants and also 
by Suppressor of fused mutants, Preat el al. Genetics 135: 1047^62 (1993). However, whereas /aserf null 
mutations and N-terminal kinase domain mutations can be" fully suppressed' by Suppressor of fused 
mutations. C-terminus mutations of fused display a strong Cos-2 phenotype in a Suppressor of fused 
background. This suggests that the fused kinase domain can act as a constitutive activator of SHh signaling 
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when Suppressor of Fused is not present. Recent studies have shown that the 92 kDa Drosophila fused. Cos- 
2 and Ci are present in a microtubule associated multiprotein complex and that Hh signaling leads 10 
dissociation of this complex from microtubules, Robbins ei al. Cell 90: 225-34 ( 1 997); Sisson et al. Cell 90: 
235-45 (1997). Both fused and Cos-2 become phosphorylated in response to Hh treatment, Robbins et al. 
supra: Therond et al.. Genetics J42: 1181-98 (1996), but the kinase(s) responsible for this activiry(ies) 
remain to be characterized. To date, the only known vertebrate homologues for these components are 
members of the Gli protem fam.ly (e.g.. Gli-I, Gli-2 and Gli-3). These are zinc finger putative transcnpt.on 
factors that are structurally related to Ci. Among these, G//-1 was shown to be a candidate mediator of the 
SHh signal [Hynes et al.. Neuron 15: 35-44 (1995), Lee et al.. Development 124: 2537-52 (1997); Alexandre 
et al.. Genes Dev. 10: 2003-13 (1996)] suggesting that the mechanism of gene activanon in response to Hh 
may be conserved between fly and vertebrates. To determine whether other signaling components in the Hh 
cascade are evoiutionarily conserved and to examine the function of fused in the Hh signaling cascade on the 
biochemical level, Applicants have isolated and characterized the human fused cDN A. Tissue distribution on 
the mouse indicates that fused is expressed in SHh responsive tissues. Biochemical studies demonstrate that 
fused is a functional kinase. Functional studies provide evidence that fused is an activator of Gli and that a 
dominant negative form of fused is capable of blocking SHh signaling in Xenopus embryos. Together this 
data demonstrated that fused is directly involved in Hh sianalins. 

Applicants have identified a cDNA encoding a human fused (hfused) polypeptide and thus have 
provided for the first time a vertebrate fused molecule. 

SUMMARY OF THE 1NVRNTION 
In one embodiment, the invention provides an isolated nucleic acid molecule having at least about 
80% sequence identity to (a) a DNA molecule encoding a fused polypeptide comprising the sequence of 
amino acids 1 to 260 of Fig. 1 (SEQ ID NO. 24), or (b) the complement of the DNA molecule of (a); and 
encoding a polypeptide having fused biological activity. The sequence identity preferably is about 85%, 
more preferably about 90%, most preferably about 95%. In one aspect, the isolated nucleic acid has at least 
about 80%, preferably at least about 85%, more preferably at least about 90%. and most preferably at least 
about 95% sequence identity with a polypeptide having amino acid residues 1 to about 13 15 of Fig. I (SEQ 
ID NO. 2). Preferably, the highest degree of sequence identity occurs within the kinase domain (amino acids 
1 to about 260 (SEQ ID NO:24 as shown in Fig. 1). Especially preferred are those nucleic acid molecule 
containing a coding sequence for a lysine at amino acid position 33. In a further aspect, the isolated nucleic 
acid molecule comprises DNA encoding a human fused polypeptide having amino acid residues I to about 
260 (SEQ ID NO:24 as shown in Fig. I). In yet another aspect, the invention provides for an isolated 
nucleic acid comprising DNA having at least a 95% sequence identity to (a) a DNA molecule encod.ng the 
same mature polypeptide encoded by the cDNA in ATCC Deposit No. 209637 (designation: 
P RK5tkneo.hFused-l272). alternatively the coding sequence of clone P RK5tkneo.hFused-1272, deposited 
under accession number ATCC 209637. In a still further aspect, the invention provides for a nucleic acid 
comprising human fused encoding sequence of the cDNA in ATCC deposit No. 209637 (designation: 
P RK5tkneo.hFused-l272) or a sequence which hybridizes thereto under stringent conditions. 

In another embodiment, the invention provides a vector comprising DNA encoding a vertebrate 
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Wpo.vpeptide. A host cell comprising such a vector is also provided. By way of example, the host cells 
may be mammalian cells, (e.g. CHO cells), prokaryouc cells (e.g.. £ «*> or yeast cel.s (e.g., Saccharoses 
cerev.siae) A process for producing vertebrate fused po.ypeptides is further prov.ded and composes 
during host cells under conditions suitab.e for expression of vertebrate/,^ and recovering the same from 
5 the cell culrure. 

,„ yet another embodiment, the invention provides an .soiated vertebrate fused polypept.de. In 
part.cu.ar. the .nvention provides .soiated nat.ve sequence vertebrate fused po.ypeptide, which ,n one 
embodiment is a human fused inc.ud.ng an amino acid sequence compris.ng residues 1 to about 13.5 of 
F.eure 1 (SEQ ID NO. 2). Human and other nat.ve vertebrate/^ polypeptides with or w.thout the 
10 initiating meth.on.ne are specifically included. Alternatively, the .nvent.on prov.des a vertebrate fused 
polypeptide encoded by the nucleic acid deposited under accession number ATCC 2096,7. 

,„ yet another embod.ment, the invent.on provides ch.menc mo.ecu.es comprising a vertebrate 
fused polypeptide fused to a heterologous po.ypeptide or amino acid sequence. An example of such a 
ch.meric molecule comprises a vertebrate fused polypept.de fused to an ep.tope tag sequence or a constant 

15 region of an immunoglobulin. 

In yet another embodiment, the invention provides an expressed sequence tag (EST) compns.ng the 

nucleotide sequences identified in Fig. 2 as 2515662 (SEQ ID NO. 3). 

,n yet another embodiment, the invention prov.des for compounds and methods for develop.ng 
antagon.sts aea.nst and agon.st promoting/^ modulation of Hedgehog signaling. In parucular, an 
antagon.st of vertebrate fused which blocks, prevents, inhibits and/or neutralized the norma, funct.on.ng of 
fused in SH siena.ing pathway, inc.uding both small bioorganic molecules and ant.sense nuc.eot.de, 

In yet another embodiment, the invention provides for a.temative.y spliced var.ants of ^ fused. 
,„ still vet a further embodiment, the .nvent.on provides a method of screening or assay.ng for .dent.fy.ng 
m olecuies that modulate the /^activation of hedgehog signaling. Preferably, the molecules either prevent 
.nteraction of fused with its assoc.ative comp.exing prote.ns or prevent or inhibit dissoc.at.on of complexes. 
The assay comprises the .ncubation of a m.xture compr.s.ng fused and a substrate (e.g.. Gl, C0UP-TFI1. 
sl.mt CBP MBP) with a candidate molecule and detect.on of the ability of the cand.date molecule to 
m0 dulate fused phosphorylation of Us substrate. The screened molecules preferably are small 
candidates. In part.cu.ar, the method relates to a technique for screening for antagon.sts or agon.sts effused 

30 biological activity comprising: 

(a) exposing the fused expressing target cells in culture to a candidate compound: and 

(b ) analyzing cell lysates to asses the level and/or .dentity of phosphorylation: or 

(c) scoring phenotypic or functional changes in treated cells; 

and compare the results to control cells which were not exposed to the candidate compound. 
35 in yet another embod.ment. the method relates to a technique of diagnosing to determ.ne whether a part.cu.ar 
disorder is modulated by hedgehog signaling, comprising: 

(a) culturing test cells or tissues; 

(b) administer^ a compound which can inhibit/***/ modu.ated hedgehog signaling; and 

(c) measuring the degree of kinase anenuation on the fused substrate in cel. .ysates or 
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hedgehog mediated phenotypic effects in the test cells. 

BRIEF DESCRIPTION Q F THE DRAWING 

Figures 1A-IE show the nucleot.de (SEQ ID NO. 1) and derived amino acid sequence (SEQ ID 

NO. 2) of a native sequence of human fused polypeptide. Included are the kinase domain (residues 1 to 

about 260) (SEQ ID NO 24) and the ATP binding site at about amino acid position 33. 

Figure 2 shows the EST 2515662 (SEQ ID NO. 3) that was used in the cloning of the human full- 
length sequence. 

Figures 3A-3B show a comparison between human and Drosophila fused (SEQ ID NOS. 2 and 23. 
respectively). Gaps introduced for optimal alignment are indicated by dashes. Identical amino acids are 
boxed. The lysine residue mutated in fused-DN (dominant negative, lysine at amino acid position 33) is 
shown with a star. 

Figures 4A-4E show the sequence of DNA28494 (SEQ ID NO. 6) that was an incorrectly spliced 
variant of human fused isolated from a fetal lung library. This clone contains a potential initiation 
methionine at position 1 16 followed by an open reading frame of 1944 bp. A second open reading frame is 
present from about position 2295 to 4349. There is one nucleotide difference between clone DNA28495 
(SEQ ID NO. 4) and clone DNA28494 (SEQ ID NO. 6) located in the first ORF at position 1863 of clone 
28495 (SEQ ID NO. 4) (A vs. G) which changes the cod.ng sequence from an Gin to a Arg at position 583. 
The first open reading frame of DNA28494 (SEQ ID NO. 6) starts at residue 1 1 5 and is followed by a 630 
amino acid long open reading frame. 

Figures 5A-5E show shows sequence of DNA28494 (SEQ ID NO. 6) that was another incorrectly 
spliced variant of human fused isolated from a fetal lung library. 

Figure 6 is a western blot of the PCR product of an epitope tag of DNA28495 (SEQ ID NOS. 5 & 
21) and DNA28494 (SEQ ID NOS. 7 & 22). A specific band of 150 kDa was detected in the cell pellet of 
cells transfected with the construct corresponding to clone DNA28494 (SEQ ID NO. 6) and a specific band 
of approx.mately 100 kDa could be detected for clone DNA28495 (SEQ ID NO. 4) (Fig. 6). These bands 
were not present in the mock transfected control. The presence of the 100 kDa band suggests the two open 
reading frames of DNA28494 (SEQ ID NO. 6) can be spl.ced together to direct the synthesis of a large 
protein of 150 kDa. The absence of this band for DNA28495 (SEQ ID NO. 4) suggested that this clone 
apparently cannot be correctly spliced. 

Figure 7 is a northern blot analysis of hum^n fined (SEQ ID NO I ). Multiple human fetal and adult 
tissue northern blots were probes with a human fused cDH A probe. 

Figures 8A-8F show is a photograph showing in situ hybridization of embryonic and adult tissues 
with fused (SEQ ID NO 1). Sagittal sections of E1I.5 (Fig. 8A) and EI3.5 (Fig. 8B) mouse embryos. 
Coronal section through the spinal chord of El 1.5 (Fig. 8C) and EI3.5 (Fig. 8D) mouse embrvo. Sasirtal 
section through PI (Fig. 8E) and adult (Fig. 8F) mouse. Cp. choroid plexus; hb. hindbrain: hip. hippocampal 
formation: ht. heart; hy, hypothalamus: kd, kidney; Ig, lung; mb. midbrain: md. midgut; mnd. mandibular 
component of first branchial arch; sc, spinal cord; st, stomach: tec, midbrain tectum; vh. ventral hom of 
spinal cord: vm. ventral midbrain. Scale bars: Fig. 8A, 1.0 mm; Fig. 8B. 1.62 mm; Fig. 8C, 0.14 mm; Fig. 
8D, 0. 1 7 mm; Fig. 8E, 2.0 mm; and Fig. 8F, 3. 1 mm. 
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Fibres 9A.9C are , photograph showing in * hybr.dtza.ion showing ,h. presence .f/W 
„ R NA in hi-h levels in the adul, mouse teste, (Fig. 9A,. High magnification reveals differences ,» levels o 
:Zln within seminiferous .ubu.es CF* 90 Hybrid^ - - -* - ' — — — 
r™beto>serf gave no hybridation (Fig. 9B). 

Fi-ures I0A-IOB are a bar graph representing ,he ac,v,„o„ of Gh by M <F* <<>«• 
C3HI0TI/O ccB, were corrected wi.h a p9XGULus. p.XBenilla luctferase and A- - 
mtttams Cells were hatred 4» after rransfecuon and ,he iueiferase activity was assayed as descrtbed ,„ 
lie 7 (Fig. ,0B,: « —ion of a Oii reponer _ C3H,0T„2 ce»s were 
eo.rle.ed wi.h a p9XC,iUc reponer construe, pBtB.nilla ioeiFerase and a CMV driven 
„ v .e,or fo, bused or various nased mutants. Ce„s were harves.ed 4S hours aner transfecon - *~ 
activity was assayed as described in ,he Examples. The dara represenrs .he mean of duphca.rve 

de ' emi " S I IA-HE are a pborograph showing that/n^DN (SEQ ,D NO ,5, inhibits** signing 
.neariy^n* development. Depicted are: (Fig. ! I A) Dorsal view of ladpote stage embryos. Top.mbryo 
„ is/i-DN (SEQ ,0 NO 25, injection and booom emmyo is the eontroi, (Fig. UB, Side v.ew of 
Z TOP embryo is/WDN (SEQ ,D NO 25, injection and bottom embryo .s*e conno 

, ,c * UP) Pax-b staining of stage ,6 neurul. embryos injected with contro, DNA and/^-DN (SEQ ID 
NO 25) respective,,; (Fig. . ,E) SHH expression in the Boor pla.e of neurula stage conttoi embryo <«, o, 

„ t ;r ::;:r r 21 - ■-, — « , d no „ - ,« 

" activation "of OF,. Depicted are 2,3 ce„s transfected wi.h HA tagged yW constructs as indicated m 
Exampie ,0 and immunopreeipi.a.ed with ant.-HA anuhodies and protein A sepharose. Ptotem beads 
were subjected to ,„ vitro Kinase assay as described in Example ,0 in the presence of MB* 

prT . „ en r,Fe.rBlpTION OF™F PPFFFBRED EMBODIMENTS, 

~>S I Definitions . 

The terms "vertebrate/,,^ and -vetaebrateyW polypeptide- when used herem encompass n t.vc 
scuence vertebrate/,^ and vertebrate^ vartants ate further deFmed herein, having ,W 

from another source, or prepared by recombinant or synthetic methods. 

A -native se q n=ne. vertebmteyW comprises a polypepr.de having the same ammo acd sequence 

0 r can be produced by recombinant and* synthetic means. The term "native science — >- 
speciBcal, encompasses naturally occu„i»g treated forms of venebrateyW, natuoally occa m „g var, m 
Ls (e.,. alternatively sphced forms, and naturahy-oeeurring allelic variants of vertebrate^, Na ,v 
35 vertebrate ^ includes e.g., yW in mammals such as human, murine, bovine, porcine, enu.ne. fehne 
canine etc and preferably refers .0 human. Thus, one embod.men, of the invention, .He nattve 
human vertebrate/,*,, is a mature o, fulblengrh native human ve.ebrate/W comprising amtno acds to 
,315 of SEQ IDNO: 2 as shown in Fig. 1 with or without the initiating methionine a, pos.tton 

-VerteorateyW varian," means an active vertebrate/^ as deftned below having a, least about 
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80% amino acid sequence identity to (a) a DNA molecule encoding a vertebrate fused polypeptide, or (b) the 
complement of the DNA molecule of (a). In a particular embodiment, the vertebrate fused variant has at least 
about 80% amino acid sequence homology with the vertebrate fused having the deduced amino acid 
sequence (SEQ ID NO:2) shown in Fig. 1 for a full-length native sequence vertebrate fused. Such vertebrate 
fused variants include, without limitation, vertebrate fused polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-terminus of the sequence of Fig. 1 (SEQ ID NO 2). Preferably, 
the nucleic acid or amino acid sequence identity is at ieast about 85%. more preferably at least about 90%. 
and even more preferably at least about 95%. 

"Percent (%) amino acid sequence identity" with respect to the vertebrate fused sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the vertebrate fused sequence, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substitutions as part of the sequence identify. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the art, for instance, using 
publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNA STAR) 
software. Those skilled in the an can determine appropriate parameters for measuring alignment, including 
any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. 
"Percent (%) nucleic acid sequence identity" with respect to the vertebrate fused sequences identified herein 
is defined as the percentage of nucleotides in a candidate sequence that are identical with the nucleotides in 
the vertebrate /twerf sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the 
maximum percent sequence identity. Alignment for purposes of determining percent nucleic acid sequence 
identity can be achieved in various ways that are within the skill in the art, for instance, using publicly 
available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the an can determine appropriate parameters for measuring alignment, including any algorithms 
needed to achieve maximal alignment over the full length of the sequences being compared. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising vertebrate 
fused polypeptide, or a portion thereof, fused to a "tag polypeptide". The tag polypeptide has. enough 
residues to provide an epitope against which an antibody can be made, yet is short enough such that it does 
not interfere with activity of the vertebrate fused polypeptide. The tag polypeptide preferably also is fairly 
unique so that the antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides 
generally have at least six amino acid residues and usually between about 8 to about 50 amino acid residues 
(preferably, between about 1 0 to about 20 residues). 

As used herein, the term '"immunoadhesin*" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesin comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e.. is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesins may be obtained from anv 
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immunoglobulin, such as lgG-1. lgG-2. IgG-3 or lgG-4 subtypes. IgA (including IgA-l and lgA-2. IgE. lgD 
or IsM. Immunoadhesion reponed in the literature include fusions of the T cell receptor - [Gascoigne et al. 
Proc. Natl. Acad. Sci. USA 84: 2936-2940 (1987)]; CD4" [Capron et al.. Nature 337: 525-531 (1989); 
Traunecker et al. Nature 339: 68-70 (1989): Zerrmeiss! et al. DNA Cell Biol. USA 9: 347-353 (1990); Byrn 
5 et al. Nature 344, 667-670 (1990)]; L-selectin (homing receptor) [Watson et al, J. Cell Biol. J_Kh2221-2229 

(1990) ; Watson etaLNaJure 349, 164-167 (1991)]; CD44" [Aruffo et al. Cell 61, 1303-1313 (1990)]; CD28 - 
and B7" [Linsleye/ al. J. Exp. Med. J73, 721-730 (1991)]; CTLA-4" [Lisley et al. J. Exp. Med. \J±, 561-569 

(1991) ]; CD22- [Stamenkovic et al. Cell 66. 1 133-1 144 (1991)]; TNF receptor [Ashkenazie/ al, Proc. Natl. 
AcadSci. USA 88, 10535-10539(1991); Lessiauere/ al, Eur. J. Immunol 27, 2883-2886(1991); Peppelc/a/., 

10 J. Exp. Med. 174, 1483-1489 (1991)]; NP receptors [Bennen et al, J. Biol. Chem. 266, 23060-23067 (1991)]; 
IgE receptor a-chain' [Ridgway and Gorman, J. Cell. Biol. 115, abstr. 1448 ( 1991 )]; HGF receptor [Mark, M.R. 
et al, 1992, submitted !, where the asterisk f) indicates that the receptor is member of the immunoglobulin 
superfamily. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, 
15 and generally is an empirical calculation dependent upon probe length, washing temperature, and salt 
concentration, in general, longer probes require higher temperatures for proper annealing, while shorter 
probes need lower temperatures. Hybridization generally depends upon the ability of denatured DNA to 
rea:- )eal when complementary strands are present in an environment near but below their T™ (melting 
temperature). The higher the degree of desired homology between the probe and hybridizable sequence, the 
20 higher the relative temperature which can be used. As a result, it follows that higher relative temperatures 
would tend to make the reaction conditions more stringent, while lower temperatures less so. Moreover, 
stringency is also inversely proportional to salt concentrations. For additional details and explanation of 
stringency of hybridization reactions, see Ausubel et al. Current Protocols in Molecular Biology (1995). 

"Stringent conditions." as defined herein may be identified by those that: (I) employ low ionic 
25 strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium 
citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a denaturing agent, such as 
formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0.1% FicoWO.1% 
polyvinylpurrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM 
sodium citrate at 42°C; (3) employ 50% formamide. 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
30 sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardfs solution, sonicated salmon sperm 
DNA (50 ug/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55°C, followed by a high-stringency wash consisting of 0.1 x 
SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al. Molecular 
35 Cloning: A Laboratory- Manual (New York: Cold Spring Harbor Laboratory Press, 1989), and include the 
use of a washing solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less 
stringent than described above. An example of moderately stringent conditions is a condition such as 
overnight incubation at 37°C in a solution comprising: 20% formamide. 5 x SSC (150 mM NaCl. 15 mM 
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trisodium citrate). 50 mM sodium phosphate (pH 7.6). 5 x Denhardt's solution. 10% dextran sulfate, and 20 
mg/mL denatured sheared salmon sperm DNA. followed by washing the filters in 1 x SSC at about 37-50°C. 
The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to 
accommodate factors such as probe length and the like. 

"Isolated." when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. 
Contaminant components of its natural environment are materials that would typically interfere with 
diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other 
proteinaceous or non-proteinaceous solutes, in preferred embodiments, the polypeptide will be purified (I) 
to a degree sufficient to obtain at least ^residues of N-terminal or internal amino acid sequence by use of a 
spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions 
using Coomassie blue or, preferably, silver stain. Isolated polypeptide includes polypeptide in situ within 
recombinant cells, since at least one component of the vertebrate fused natural environment will not be 
present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" vertebrate fused nucleic acid molecule is a nucleic acid molecule that is identified and 
separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the 
natural source of the vertebrate fused nucleic acid. An isolated vertebrate fused nucleic acid molecule is 
other than in the form or setting in which it is found in nature. Isolated vertebrate fused nucleic acid 
molecules therefore are distinguished from the corresponding native vertebrate fused nucleic acid molecule 
as it exists in natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for 
prokaryotes. for example, include a promoter, optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a presequence or secretory leader is operabiy linked to DNA 
for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or 
a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous, and, in 
the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the 
synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. 

The term ' antibody" is used in the broadest sense and specifically covers single monoclonal 
antibodies (including agonist and antagonist antibodies), antibody compositions with polyeptopic specificity, 
as well as antibody fragments (e.g., Fab, F(ab'), and Fv). so long as they exhibit the desired biological 
activity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
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except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal 
antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to 
conventional (polyclonal) antibody preparations that typically include different antibodies directed agatnst 
different determinants (epitopes), each monoclonal antibody is directed against a single determinant on the 
5 antigen. In addition to their specificity, the monoclonal antibodies are advantageous in that they are 
synthesized by the hybridoma culture, uncontaminated by other immunoglobulins. The modifier 
••monoclonal" indicates the character of the antibody as being obtained from a substantially homogeneous 
population of antibodies, and is not to be construed as requiring production of the antibody by any particular 
method. For example, the monoclonal antibodies to be used in accordance with the present inventton may be 
10 made by the hybridoma method first described by Kohler & Milstein, Nature 256:495 (1975). or may be 
made by recombinant DNA methods [see. e.g. U.S. Patent No. 4,816,567 (Cabilly et o/.)]. 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) in 
which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences 
in antibodies derived from a particular spec.es or belonging to a particular antibody class or subclass, while 
15 the remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies 
derived from another species or belonging to another antibody class or subclass, as well as fragments of such 
antibodies, so long as they exhibit the desired biological activity [U.S. Patent No. 4,816.567; Cabilly el ai: 
Morrison et al Proc Natl. Acad. Sci. USA 81. 6851-6855 (1984)]. 

"Humanized" forms of non-human (e.g. murine) antibodies are chimeric immunoglobulins, 
20 .mmunoglobulin chains or fragments thereof (such as Fv, Fab. Fab, F(ab') 2 or other ant.gen-binding 
subsequences of antibodies) which conta.n minimal sequence derived from non-human immunoglobulin. 
For the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues 
from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a 
non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
25 capacity. In some instances, corresponding non-human residues replace Fv framework residues of the 
human immunoglobulin. Furthermore, humanized antibody may comprise residues that are found neither in 
the recipient antibody nor in the imported CDR or framework sequences. These modifications are made to 
further refine and optimize antibody performance. In general, the humanized antibody will comprise 
substantially all of at least one. and typically two, variable domains, in which all or substantially all of the 
30 CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR 
regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also 
will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. For further details see: Jones e, at.. Nature 321- 522-525 (1986): Reichmann et at., Nature 
332, 323-329 (1988); Presta. r„rr. On. Struct. Biol. 2 593-596 (1992) and U.S. Patent No. 5.225.539 
35 (Winter) issued July 6, 1 993. 

- — - - "Active" or "activity" for the purposes herein refers to" form(s) of vertebrate fused which retain the 

biologic and/or immunologic activities of native or naturally occurring vertebrate fused. A preferred activity 
is the ability to bind to and affect, e.g., block or otherwise modulate, hedgehog signaling. The activity 
preferably involves the regulation of the pathogenesis of Basal cell carcinoma. Another preferred biological 

10 



SDOCID: <WO_9943828A1 J_> 



WO 99/43828 PCT/US99/04 1 1 2 

activity is the ability to phosphorylate or modulate the phosphorylation of Gli. 

The term "antagonist" is used herein in the broadest sense to include any molecule which blocks, 
prevents, inhibits, neutralizes the normal functioning of fused in the Hh signaling pathway. One particular 
form of antagonist includes a molecule that interferes with the interaction between fused and its binding or 
complexing proteins. In a similar manner, the term "agonist" is used herein to include any molecule which 
promotes, enhances or stimulates the normal functioning of fused in the Hh signaling pathway. Suitable 
molecules that affect the protein-protein interaction of fused and its binding proteins include fragments of the 
latter or small bioorganic molecules, e.g., peptidomimetics. which will prevent or enhance, as the case may 
be, the interaction of proper complex formation. Non-limiting examples include proteins, peptides, 
glycoproteins, glycopeptides, glycolipids, polysaccharides, oligosaccharides, nucleic acids, bioorganic 
molecules, peptidomimetics, pharmacological agents and their metabolites, transcriptional and translation 
control sequences, and the like. Another preferred form of antagonist includes antisense nucleotides that 
inhibit proper transcription of wild type fused. Preferred forms of antagonists and are small molecules, 
which specifically bind to or block binding of the ATP binding site of fused. 

The term - modulation" or 'modulating" means upregulation or downregulation of a signaling 
pathway. Cellular processes under the control of signal transduction may include, but are not limited to, 
transcription of specific genes: normal cellular functions, such as metabolism, proliferation, differentiation, 
adhesion, apoptosis and survival, as well as abnormal processes, such as transformation, blocking of 
differentiation and metastasis. 

The techniques of "polymerase chain reaction," or "PCR", as used herein generally refers to a 
procedure wherein minute amounts of a specific piece of nucleic acid, RNA and/or DNA are amplified as 
described in U.S. Pat. No. 4.683,195 issued 28 July 1987. Generally, sequence information from the ends of 
the region of interest or beyond needs to be available, such that oligonucleotide primers can be designed: 
these primer will be identical or similar in sequence to opposite strands of the template to be amplified. The 
5' terminal nucleotides of the two primers may coincide with the ends of the amplified material. PCR 
sequences form total genomic DNA. and cDNA transcribed from total cellular RNA, bacteriophage, or 
plasmid sequences, etc. See generally Mullis el ai. Cold Spring Harbor Symp. Quant. Biol. 5U 263 (1987); 
Erlich, Ed., PCR Technology, (Stockton Press, NY, 1989). As used herein. PCR is considered to be one, but 
not the only, example of a nucleic acid test sample comprising the use of a known nucleic acid as a primer 
and a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid. 
". Compositions and Methods of the Invention 
A. Full-length vertebrate fused 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as human and vertebrate fused. In particular. Applicants 
have identified and isolated cDNA encoding a vertebrate fused polypeptide, as disclosed in further detail in 
the Examples below. Using BLAST. BLAST- 2 and FastA sequence alignment computer programs. 
Applicants found that a full-length native sequence human fused (shown in Figure 3 (SEQ ID NO 2)) has 
28% amino acid sequence identity with Drosophila fused (SEQ ID NO 23). Accordingly, it is presently 
believed that the human fused disclosed in the present application is a newly identified member of the 



1 1 



PCT/US99/04112 

WO 99/43828 

hedeehoe signaling cascade. 

The full-length native sequence of human vertebrate /,W gene, or portions thereof, may be used as 
hvbridization processor a cDNA library to isolate the full-length gene or to isolate still other vertebrate 
homoioo oenes (for instance, those encoding naturally-occurring variants of vertebrate fused or vertebrate 

5 fused from other species) which have a des.red sequence identity to the vertebrate fused sequence disclosed 
in Fie 1 (SEQ ID NO 1). Optionally, the length of the probes will be about 20 to about 50 bases. The 
hvbr.dizat.on probes may be derived from the nuc.eot.de sequence of Fig. 1 (SEQ ID NO 1) or from 
genomic sequences .ncluding promoters, enhancer elements and mtrons of native sequence vertebrate /W. 
By way of example, a screening method will compr.se .solating the cod.ng region of the vertebrate /»W 

, 0 gene using the known DN A sequence to synthesize a selected probe of about 40 bases. Hybr.dization probes 
may be labeled by a variety of labels, inc.ud.ng radionuc.eotides such as 32 P or 35 S. or enzymatic labels such 
as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems. Labeled probes having a 
sequence complementary to that of the vertebrate fused gene of the present invent.on can be used to screen 
Hbraries of human cDNA, genom.c DNA or mRJs'A to determine which members of such libraries the probe 

1 5 hybridizes to. 

B. Vertebrate fused Variants 

,„ addition to the full-length native sequence vertebrate fused described herein, it is contemplated 
that vertebrate fused variants can be prepared. Vertebrate fused variants can be prepared by introducing 
appropriate nucleotide changes into a known vertebrate/.^ DNA. or by synthesis of the des.red vertebrate 
20 fused po.ypept.de, Those skilled in the art will appreciate that am.no acid changes may alter post- 
translational processes of the vertebrate fused. 

Variations in the native full-length sequence vertebrate/.^ or in various domains of the vertebrate 
fused described herein, can be made, for example. ,:s,ng any of the techn.ques and guidelines for 
conservative and non-conservat.ve mutations set forth, for .nstance. in U.S. Patent No. 5,364,934. Var.at.ons 
, 5 may be a substitution, deletion or insertion of one or more codons encod.ng the vertebrate/^ that results 
,n a chanse in the am.no acid sequence of the vertebrate fused as compared with the native sequence 
vertebrate/.^. Optionally the variation is by substitution of at least one am.no acid with any other ammo 
acid in one or more of the domains of the vertebrate fused. Guidance in determ.n.ng which amino ac.d 
residue may be inserted, substituted or deleted without adversely affecting the desired activity may be found 
30 by comprint the sequence of the vertebrate fused with that of homologous known protein molecules and 
minimize the number of amino acid sequence changes made in regions of high homology. Am.no ac.d 
substitutions can be the result of replacing one amino acid with another ammo acid having similar structural 
and/or chemica. propert.es, such as the replacement of a leucine with a ser.ne. i.e., conservative amino ac.d 
replacements. Insertions or deletions may opt.onally be in the range of 1 to 5 amino acids. The var.at.on 
35 allowed may be determined by systematically making insertions, deletions or substitutions of amino acids ,n 
"" the sequence and testing the resulting variants for activity in the in vitro assay described in the Examples 
below. 

The variat.ons can be made using methods known in the art such as oligonucleotide-mediated (s.te- 
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directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutasenesis [Carter et al., 
Nucl. Acids Res. . 13:4331 (1986); Zoiler et al.. Nucl. Acids Res. . 10:6487 (1987)], cassette mutagenesis 
[Wells et al.. Gene . 34:315 (1985)], restriction selection mutagenesis [Wells et al.. Philos. Trans. R. Soc. 
London SerA . 3J7:4 15 (1986)1 or other known techniques can be performed on the cloned DNA to produce 
the vertebrate fused variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning 
amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to 
alter the main-chain conformation of the variant. Alanine is also typically preferred because it is the most 
common amino acid. Further, it is frequently found in both buried and exposed positions [Creighton, The 
Proteins, (W.H. Freeman & Co., N. Y.); Chothia, J. Mo/. Biol,, 150: 1 ( 1 976)]. If alanine substitution does not 
yield adequate amounts of variant, an isoteric amino acid can be used. 

In the human fused sequence depicted in Figure I, the kinase domain is represented by amino acid 
residues 1-260 (SEQ ID NO 24) of which position lysine 33 appears to be necessary for ATP binding and 
•thus enzymatic activity. 

C. Modifications of vertebrate fused 

Covalent modifications of vertebrate fixed are included within the scope of this invention. One type 
of covalent modification includes reacting targeted amino acid residues of the vertebrate fused with an 
organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal 
residues of the vertebrate fused. Derealization with Afunctional agents is useful, for instance, for 
crosslinking vertebrate fused to a water-insoluble support matrix or surface for use in the method for 
purifying anti-vertebrate fused antibodies, and vice-versa. Commonly used crosslinking agents include, e.g., 
I,l-bis(diazo-acetyl)-2-phenylethane, glutaraldehyde. N-hydroxy-succinimide esters, for example, esters 
with 4-azido-salicylic acid, homobifunctional imidoesters. including disuccinimidyl esters such as 3.3'- 
dithiobis-(succinimidyl-propionate), Afunctional maieimides such as bis-N-maleimido-l,8-octane and agents 
such as methyl-3-[(p-azidophenyl)-dithio]pro-pioimi-date. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
corresponding glutamyl and asparryl residues, respectively, hydroxylation of proline and lysine, 
phosphorylation of hydroxy! groups of seryl or threonyl residues, methylation of the a-amino groups of 
lysine, arginine, and histidine side chains [T.E. Creighton. Proteins: Structure and Molecular Properties . 
W.H. Freeman & Co.. San Francisco, pp. 79-86 (1983)], acetylation of the N-tenrtinal amine, and amidation 
of any C-terminal carboxyl group. 

Another type of covalent modification of vertebrate fused comprises iinking the vertebrate fused 
polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene 
glycol, or polyoxyalkylenes. in the manner set forth in U.S. Patent Nos. 4,640.835 : 4.496,689; 4,301,144; 
4,670.417; 4.791,192 or 4.179,337. Such modifications would be expected in increase the haif-life of the 
molecules in circulation in a mammalian system; Extended half-life of fused molecules might be useful under 
certain circumstances, such as where the fused variant is administered as a therapeutic agent. 
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The vertebrate fused of the present invention may also be modified in a way to form a chimeric 
molecule comprising vertebrate fused bonded to another, heterologous polypeptide or amino acid sequence. 
In one embodiment, such a chimeric molecule comprises a fusion of the vertebrate fused with a tag 
polypeptide, which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino- or carboxyl- terminus of the vertebrate/^. The presence of such epitope- 
tagsed forms of the vertebrate fused can be detected us.ng an antibody against the tag polypeptide. Also. 
provis.cn of the epitope tag enables the vertebrate fused to be readily purified by affinity purification us.ng 
an ant.-tag antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of the vertebrate fused w.th an immunoglobulin 
or a particular region of an immunoglobulin. For a bivalent form of the chimenc molecule, such a fusion 
could be to the Fc region of an lgG molecule. 

Ordinanly.theC-terminusof a contiguous amino acid sequence of a l.gand-(IFN-Y-) binding domain of an IFN- 
v receptor is fused to the N-terminus of a contiguous amino acid sequence of an immunoglobulin constant 
resion. in place of the variable region(s), however N-terminal fusions are also possible. 

Typically, such fusions reta.n at least functionally active hinge, CH2 and CH3 domains of the constant 
region of an immunoglobulin heavy chain. Fusions are also made to the C-terminus of the Fc portion of a 
constant domain, or immediately N-term.nal to the CH 1 of the heavy chain or the corresponding region of the 
light chain. This ordinarily is accomplished by constructingthe appropriate DNA sequence and expressing it in 
recombinant cell culture. Altematively,immunoadhesinsmay be synthesized according to known methods. 

The precise site at which the fusion is made is not critical; particular sites are well known and may be 
selected in orderto optimize the biological activity, secretion or binding characteristicsof the immunoadhesins. 

In a preferred embodiment, the C-terminus of a contiguous amino acid sequence which comprises the 
binding site(s) for IFN-y is fused, at the N-terminal end. to the C-terminal portion of an antibody (in particular 
theFc domain), containing the effector functions of an immunoglobulin, e.g. immunoglobulin G, (IgG-l). As 
hereinabove mentioned, h is possible to fuse the entire heavy chain constant region to the sequence containing 
the binding site(s). However, more preferably, a sequence beginning in the hinge region just upstream of the 
papain cleavage site (which defines IgG Fc chemically; residue 216, taking the first residue of heavy chain 
constant region to be 1 14 [Kobet et ai, supra], or analogous sites of other immunoglobulins) is used in the 
fusion. Although it was earlier thought that in immunoadhesins the immunoglobulin light chain would be 
required for efficient secretion of the heterologous protein-heavy chain fusion proteins, it has been found that 
even the immunoadhesinscontaining the whole IgG I heavy chain are efficiently secreted in the absence of light 
chain. Since the light chain is unnecessary, the immunoglobulin heavy chain constant domain sequence used in 
the construction of the immunoadhesins of the present invention may be devoid of a light chain binding site. 
This can be achieved by removing or sufficiently altering immunoglobulin heavy chain sequence elements to 
which the light chain is ordinarily linked so that such binding is no longer possible. Thus, the CH 1 domain can 
be entirely removed in certain embodiments of the IFN-y receptor-immunoglobulinchimeras. 

In a particularly preferred embodiment, the amino acid sequence containing the extracellulardomain of 
an IFN-y receptor is fused to the hinge region and CH2. CH3; or CHI. hinge, CH2 and CH3 domainsof an IgG- 
1 IgG-2. IeG-3. or lgG-4 heavy chain. The construction of a typical structure is disclosed in Example I. 
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In some embodiments, the IFN-y receptor-immunoglobulin molecules (immunoadhesins) are 
assembled as monomers, dimers or multimers, and particularly as dimers or tetramers. Generally, these 
assembled immunoadhesins will have known unit structures similar to those of the corresponding 
immunoglobulins. A basic four chain structural unit (a dimer of two immunoglobulin heavy chain-light chain 
5 pairs) is the form in which IgG, IgA and IgE exist. A four chain unit is repeated in the high molecular weight 
immunoglobuIins;IgM generallvexists as a pentamer of basic four-chain units held together by disulfide bonds. 
IgA globulin, and occasionally IgG globulin, may also exist in a multimeric form in serum. In the case of 
multimers. each four chain unit may be the same or different. 

It is not necessary that the entire immunoglobulin portion of the IFN-y receptor-immunoglobulin 

10 chimeras be from the same immunoglobulin. Various portions of different immunoglobulins may be combined, 
and variants and derivatives of native immunoglobulins can be made as hereinabove described with respect to 
IFN-y, in order to optimize the properties of the immunoadhesin molecules. For example, immunoadhesin 
constructs in which the hinge of IgG- 1 was replaced with that of IgG-3 were found to be functional and showed 
pharmacokineticscomparabie to those of immunoadhesinscomprisingthe entire IgG- 1 heavy chain. 

15 Various tag polypeptides and their respective antibodies are well known in the an. Examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and 
its antibody I2CA5 [Field et al.. Mol. Cell. Biol. . 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 
6E10, G4, B7 and 9EI0 antibodies thereto [Evan et al., Molecular and Cellular Biology . 5:36)0-3616 
(1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., Protein 

20 Engineering , 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., 
BioTechnology , 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al.. Science , 255:192-194 (1992)]; 
an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA , 87:6393-6397 (1990)]. A preferred 
tag is the influenza HA tag. 

25 D. Preparation of vertebrate fused 

The description below relates primarily to production of a particular vertebrate fused by cuituring 
cells transformed or transfected with a vector containing vertebrate fused nucleic acid. It is, of course, 
contemplated that alternative methods, which are well known in the an, may be employed to prepare 
vertebrate fused. For instance, the vertebrate fused sequence, or portions thereof, may be produced by direct 

30 peptide synthesis using solid-phase techniques [see. e.g., Stewart et al., Solid-Phase Peptide Synthesis , W.H. 
Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. Chem. Soc , 85:2149-2154 (1963)]. In vitro 
protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be 
accomplished, for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using 
manufacturer's instructions. Various portions of the vertebrate fused may be chemically synthesized 

35 separately and combined using chemical or enzymatic methods to produce the full-length vertebrate fused. 
1. Isolation of DNA Encoding vertebrate fused 

DNA encoding vertebrate fused may be obtained from a cDNA library prepared from tissue 
believed to possess the vertebrate fused mRNA and to express it at a detectable level. Accordingly, human 
vertebrate fused DNA can be conveniently obtained from a cDNA library prepared from human tissue, such 
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as described in the Examples. The vertebrate /^-encoding gene may also be obtained from a genomic 
library or by oligonucleotide synthesis. 

Libraries can be screened with probes (such as antibodies to the vertebrate or oligonucleotides 
of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening 
the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as 
described in Sambrook et al., Molecular CloninP. A Laborato ry Manual (New York: Cold Spring Harbor 
Laboratory Press. 1989). An alternative means to isolate the gene encoding vertebrate fused is to use PCR 
methodology [Sambrook et aL supra; Dieffenbach et aL, PCR Primer: A Laboratory Manual (Cold Spring 

Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide 
sequences selected as probes should be of sufficient length and sufficiently unambiguous that false positives 
are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization- to 
DNA in the library being screened. Methods of labeling are well known in the art, and include the use of 
radiolabels like 32 P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including 
1 5 moderate stringency and high stringency, are provided in Sambrook et aL, supra. 

Sequences identified in such library screening methods can be compared and aligned to other 
known sequences deposited and available in public databases such as GenBank or other private sequence 
databases. Sequence identity (at either the amino acid or nucleotide level) within defined regions o' the 
molecule or across the full-length sequence can be determined through sequence alignment using computer 
20 software programs such as BLAST. BLAST-2, ALIGN, DNAstar. and INHERIT which employ various 
algorithms to measure homology. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or 
genomic libraries using the deduced amino acid sequence disclosed herein for the first time, and, if 
necessary, using conventional primer extension procedures as described in Sambrook et a)., supra, to detect 
25 precursors and processing intermediates of mRN A that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 

Host cells are transfected or transformed with expression or cloning vectors described 
herein for vertebrate fused production and cultured in conventional nutrient media modified as appropriate 
for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 

30 The culture conditions, such as media, temperature, P H and the like, can be selected by the skilled artisan 
without undue experimentation. In general, principles, protocols, and practical techniques for maximizing 
the productivity of cell cultures can be found in Mammalian Cell Biotechnology: A Practical Approach, M. 
Butler, ed. (1RL Press, 1991) and Sambrook et al., supra. 

Methods of transfection are known to the ordinarily skilled artisan, for example, CaPO, and 

35 electroporation. Depending on the host cell used, transformation is performed using standard techniques 
appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et 
al., supra, or electroporation is generally used for prokaryotes or other cells that contain substantial cell-wall 
barriers. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al.. Gene, 23:3 15 (1983) and WO 89/05859 published 29 June 1989. For mammalian 
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cells without such cell walls, the calc.um phosphate precipitation method of Graham and van der Eb. 
Virology 52:456-457 (1978) can be employed. General aspects of mammalian cell host system 
transformations have been described in U.S. Patent No. 4.399.216. Transformations into yeast are typically 
carried out according to the method of Van Solingen et a/.. J. Baa.. 130:946 (1977) and Hstao et ai. Proc. 
» Na,L Acad. Sci. (USA). 76:3829 ( 1 979). However, other methods for introducing DNA into cells, such as by 
nuclear microinjection, electroporation. bacterial protoplast fusion with intact cells, or polycations, e.g., 
polybrene, polyomithine. may also be used. For various techniques for transformmg mammalian cells, see 
Keown et ai. Methods m Enzymology, 185:527-537 (1990) and Mansour et a/.. Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, 
yeast, or h.gher eukaryote cells. Sunable prokaryotes include but are not limited to eubacteria, such as 
Gram-negative or Gram-positive orgamsms. for example, Enterobacteriae such as E coli. Various £. coli 
strains are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli XI 776 (ATCC 
31,537); E. constrain W3110(ATCC 27,325) and K5 772 (ATCC 53.635). 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable 
cloning or expression hosts for vertebrate/^-encoding vectors. Saccharomyces cerevisiae is a commonly 
used lower eukaryotic host microorganism. 

Suitable host cells for the expression of vertebrate/^ are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodopterr Sf9, as well as 
plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS 
cells. More specific examples include monkey kidney CVI line transformed by SV40 (COS-7, ATCC CRL 
1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham 
et ai. J. Gen ViroL, 36:59 (1977)); Chinese hamster ovary cellsZ-DHFR (CHO, Urlaub and Chasin, Proc. 
Natl. Acad. Sci. USA, 77:4216 (1980)); mouse Sertoli cells (TM4. Mather, Biol. Reprod. 23:243-251 (1980)); 
human lung ceils (WI38, ATCC CCL 75); human liver ceils (Hep G2. HB 8065); and mouse mammary' 
tumor (MMT 060562. ATCC CCL51). The selection of the appropriate host cell is deemed to be within the 
skill in the art. 

3. Selection and Use of a Replicable Vecrm- 

The nucleic acid (e.g., cDNA or genomic DNA) encoding vertebrate fused may be inserted 
into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are 
publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or 
phage. The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In 
general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the 
art. Vector components generally include, but are not limited to. one or more of a signal sequence, an ongin 
of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these components employs standard 
ligation techniques, which are known to the skilled artisan. 

Both expression and cloning vectors contain a nucleic acid sequence tha't enables the vector to 
replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, 
and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, 
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,„e ,. p.asmid origin is sui.able f». yeas, and various .Ira, origins (SV40. polyoma, adenovirus. «V or 
B pv") are useful fo, Coning vectors in —Han cells. A prefab r.plieable expression veco, ,s ,ha 
plasm id is pRKS. Holmes ei al.. Science, 253: 1 278- 1 280 ( 1 99 1 ) 

Expression and Con™ vec,o,s will typically contain a selection gene, also ,ermed a staple 
made., Typical selecion genes encode proteins ,ba, (a) confer resistance ,0 antibiotics or otbe, toxms. e.g., 
ampicillin. neomycin. me,bo,rexa,e. 0, ,e,rac,e,ine. <b, complement auxonopbic d.f.ciencie, ., (c, supply 
cr ,,,ca, nutrients no, available from complex media. ,,. me gene encoding D-al»n,ne racem.se for * 

An example of snh.bl. selectable markers fo, mammalia cells are ,bose mat enable ,h. 
Identification of cells compe.en, ,0 take np tbe vertebra,, fi* nncleic acid, sucb as DHFR or mym* i.e 
ki „, s ,. A „ app,opr,a,e bos, «„ wben »ild-,vpe DHFR Is employed is me CHO eel, „e detenu DHER 
activity, prepared and propagated as deseed by Urfaub e, al. tree. «. Sc. US,, JJC.. (. W 
A sole selecon gene for ns. , yeas, ,s me „, gene pr.sen, in me yeas, plasntid VRp S « om^ ,e, 
a, 282:39 (1979); Kingsman e, al. Gene, 7:14, (1979); Tscbempe, e, al. Gene. 10.157 (.980,,. 

Tbe , P , genTprovides , selecrion marker fo, a mu.ap, stram of yeas, lacking me abHity ,» grow ■» 
S „p,„phan.fo,ex„p,e.ATCCNo.44076orPEP4.IUones,Ge«„c»,85:l2(1977) ) . 

Expression and Coning vec.ors usual,, c.main a promo.e, operably linked ,o ,h. vertebra,. Ji-f 
nucleic acid sentence ,o diree, mRNA syndesis. Promo.ers recognized by a vanery of po.ent.a, bos, cells 
are we,, known. Promos suitable for ose wi,b prokarymic bosts include ,ne p-lactamase and lac.os 

p„mo,e, sys,ems ,Cba„g e, a,.. M« 225-6.5 09,8,: Ooedde, . al. Ualace, 28,-4 (» « ^ 
n „ ml „romo,ers«s,emlGoeddel,««e/ei^c,& Ties,. 8:4057 (1980); EP ob.7761, 
:0 phosphamse. a nyplophan (trp) promoier system ,uo ,„.„ , 5 ,,983)1 

and bybrid promo.ers speb as ,b. „c promo,., (deBoer . al. Proc ml Ac* Sc, USA. 80.-1 5 ,98* 
Pr„mo,ers for use in bac.ria, sysrerr, a,so w.„ conum a Sbine-Dalgam. (S.D., seou.nce operably Imked ,o 

the DNA encoding vertebrate fused. „ 
Examples of sunable promo.ing seances for nse wi,h yeas, bos.s include Ibe promo.ers fo, o- 
, 5 pbospbonlvcerate kinase (Hitzeman „ *. J- BU C*., 255:2073 (1980), o, o.ber glycolytic enzymes 
Hes e.'a, J- A*- «e,. 7:149 (,968V, Holland. » 17:4900 ,1978,,, socn as eno.ase. 

, y eem,de„,de.3-pbosph.,e debydrngenase. bexpklpase, pyruvate decarboxylase, pbospbo-fru— 
g ,L.se-6.pbospba,e isomerase. 3.pb„spbog,yee ra te mo.ase, pyruvate kinase, .riosepbospba.e .somerasc, 
phosphoglucose isomerase, and glucokinase. 

Ome, yeas, promoters, wbleb are mduclble promo.ers bavmg .be add,„ona, advantage of 
transcription conned by gr„w,b conditions. a,e me promoter reg.ons for alcobo, «~ J 
.soeyrolom. C.'aCd pbospba.ase. degradativ. enzymes associa.ea wl.b n.trogen — 
memllo.blon.u, s ,vceraldeb,de-3-pb.spba,. debydrogenase. and enzymes respons.b, for m. os. » 
galactose o.illzabon. Soluble vectors and promot.rs fo, nse in yeas, expression are former descbed 

^ 73 ' 6 " Venebmte^ transcription «. *«* * «— »» «" * 

- promotets obtained from , be genomes of viruses sucb as polyoma virus. fo»,pox virus (UK 2_„.5« 
pnbllsbed 5 3o,y ,989). ad.noviros (soeh as Adenoviros 2). bovine papilloma .In,, av.an sarcoma v.ros. 
vrome-aloviros. , reboviros. bepa,i,is-B viros and Simian V.ros 40 (SV40). from bcerologons mammaban 
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promoters, e.g.. the actin promoter or an immunoglobulin promoter, and from heat-shock promoters, 
provided such promoters are compatible with the host cell systems. 

Inserting an enhancer sequence into the vector may increase transcription of a DNA encoding the 
vertebrate fused by higher eukaryotes. Enhancers are cis-acting elements of DNA. usually about from 10 to 
300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 
mammalian genes (globin. elastase, albumin, a-fetoprotein. and insulin). Typically, however, one will use 
an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at 
a position 5' or 3' to the vertebrate /use* coding sequence, but is preferably located at a site 5' from the 
promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and. 
occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain 
nucleotide segments transcribed as poiyadenyiated fragments in the untranslated portion of the mRNA 
encoding vertebrate fused. 

Still other methods, vectors, and host ceils suitable for adaptation to the synthesis of vertebrate fused 
in recombinant vertebrate cell culture are described in Gething ei ai, Nature, 293:620-625 (1981); Mantei et 
al. Nature, 28J.:40-46 ( 1 979); EP 1 17,060; and EP 1 17,058. 

4 - Detecting Gene Amplification/Exp ression 

Gene amplification and/or expression may be measured in a sample directly, for example, 
by conventional Southern blotting. Northern blotting to quantitate the transcription of mRNA [Thomas, Pr OC , 
Natl. Acad. Sci. USA , 27:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, us.ng an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be 
carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the 
presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence vertebrate fused polypeptide or against a synthetic 
peptide based on the DNA sequences provided herein or against exogenous sequence fused to vertebrate 
fused DNA and encoding a specific antibody epitope. 

5. Purification of Polypeptide 

Forms of vertebrate/wed' may be recovered from host cell lysates. If membrane-bound, it 
can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 
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cleavage. Cells employed in express.cn of vertebrate/^ can be d.srupted by vanous physical or chem.cal 
means, such as freeze-thaw cycling, sonication, mechan.cal disruption, or ceil lysing agents. 

It may be desired to purify vertebrate /,W from recombinant cell proteins or polypeptides. The 
following procedures are exemplary of su.table purification procedures: by fractionation on an ion-exchange 
5 column; ethanol precipitation: reverse phase HPLC: chromatography on silica or on a canon-exchange resin 
such as DEAE: chromatofocusing; SDS-PAGE: ammonium sulfate precipitation; gel filtration using, for 
example, Sephadex G-75; prote.n A Sepharose columns to remove contaminants such as lgG: and metal 
chelating columns to bind epitope-tagged forms of the vertebrate fused. Various methods of protein 
punfication may be employed and such methods are known in the art and described for example m 
,0 Deutscher. Methods in F.nzvmoloev. 182 (1990): Scopes, Protein Purificat.on: Principles and Pract.ce , 
Springer-Verlag, New York (1982). The purification step(s) selected will depend, for example, on the nature 
of the production process used and the particular vertebrate fused produced. 
E. Uses for vertebrate fused 

(1) Fused is universal mediator ofHh signaling 
, 5 The humane full length molecule of (Fig. I (SEQ ID NO I )) encodes a protein with a 

predicted molecular weight of 150 kDa, which is significantly larger that DrosopMa fused (100 kDa, dfused 
(SEQ ID NO 23)). Human fused (hfused) shows notable homology to the Drosophila homologue in the 
kinase domain, but little homology with dfused or any other known protein over the remaining =1000 amino 
acids. The kinase domain extends from residue 1 to about resiaue 260, as ,s represented in Fig. 1 (SEQ ID 
?0 NOS 24 & 2). This divergence at the C-terminus of the molecules is unexpected given that the C-terminus of 
the Drosophila molecule is requ.red for its activity, Preat « al. Nature 347: 87-9 (1990). An ATP binding 
site is at about amino acid position 33 and is required for kinase activity. 

Prior stud.es in Drosophila ind.cate that dfused is necessary for Hh signal to occur but have not 
addressed the issue whether/^ is sufficient to act.vate this signaling system. As depicted in the Examples. 
.5 applicants have herein used a Gli DNA binding element present in the HNF3 P promoter, in front of a 
luciferase mediator of the Hh cascade, which clearly demonstrates that fused alone is capable of act.vat.ng 
Gli mediated transcription in this system. It is further apparent that both an mtact kinase domain and an 
intact C-terminal non-catalytic doma.n are required for this activation, which supports the notion that fused 
functions as a kinase and that the C-terminus may play a role in the substrate recognition or in regulating the 
30 kinase activity. 

Applicants have shown in the present application that hfused is a kinase which is capable of 
phosphorating artificial substrates such as MBP. However, the identity of the physiological substrate for 
h/«s«/ remains to be determined. One obv.ous candidate is G//-1 itself, as G//-1 phosphorylat.on by hfused 
can be detected in vitro. 

35 To determine if human fused is essential for Hh signaling in vertebrates, a mutant was constructed 

by altering a conserved lysine in the ATP bind.ng site (about amino acid residue 33). Typically, such 
mutants ac, as inhibitor of the corresponding wild type kinase by blocking access to substrate and/or 
regulatory factors, He et al. Nature 374, 617-22 (1995). When overexposed in 2-cell stage Xenopus 
embryos, the most remarkable phenotype was the presence of fused eyes in about 30% of the injected 
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embryos. Several lines of evidence indicate that this phenorype is likely to result from the inhibition of Hh 
signaling. First, SHh knockouts display a cyclopia phenotype attributed recently to mutations in the SHh 
gene. Chiang et al. Nature 383 : 407-13 (1996). Second, zebrafish embryos (cyclops) with reduced 
expression of SHh or injected with constirutively active form of PKA. a negative regulator of the Hh pathway 
5 are cyclops. Third. SHh, emanating from prechordal plate, has been shown to inhibit expression of Pax-6. a 
key transcription factor required for eye development, in the center of a continuous eyefield, Ekker et al.. 
Curr. Biol. 5: 944-55 (1995); Li et al. Development J24: 603-15 (1997); Macdonald et al.. Development 
121 : 3267-78 (1995). Finally, staining for Pax-6 embryos injected with fused-DN revealed a single field of 
expression suggesting a failure of SHh emanating from the prechordal plate to downregulate the expression 
1 0 of Pax-6 at the center of the eyefield. 

To confirm the position of fused in the Hh signaling pathway, expression of SHh in the floor plate 
of Xenopus embryos injected with hfused-UH could be rescued by coinjection of G//-I. This suggests that 
fused acts in association with Gli in the SHh signaling pathway. 

The tissue distribution of fused shows that it is expressed in all SHh responsive cells. In particular, 
1 5 its expression pattern overlaps well with Ptch, the binding component of the Hh receptor which is itself a 
target gene of the SHh signaling pathway. These data suggest that fused is involved in mediating a wide 
variety of effect SHh has on different tissues. Functionally, this was observed again in frog embryos where, 
fused-DN inhibited eye development as well as SHh expression in the floor plate. 

bF:tsed-DN also appears to affect normal development of tissues such as the frog gut which is 
20 regulated by Indian Hh. This, combined with the fact that fused is expressed in the gut and testis, sites of IHh 
and DHh action respectively, suggest that fused may be a universal mediator of signaling for all members of 
the Hh protein family. 

Very high levels of fused mRNA was found on germ cell, the development of which appears to be 
regulated by DHh. Homozygous mutant mice for DHh fail to develop germ cells and are viable but sterile 
25 (Bitgood et al.. Curr. Biol. 6: 298-304 (1996). However. Patched, a Hedgehog receptor is expressed on 
interstitial Leydig cells and not on germ cells where fused is expressed, Bitgood et al. supra. This 
discrepancy suggests that there may be additional hedgehog receptors. 

Applicants have shown in the Examples that wild type hfused is capable of activating Gli in a 
reporter assay. Furthermore, expression of SHh in the floor plate of frog embryos injected with hfused-DH 
30 could be rescued by coinjection of Gli- 1 . Taken together these observations are consistent with the assertion 
that fused acts downstream of Smo and upstream of Gli in this signaling pathway, which is consistent with 
the genetic evidence in Drosophila to date. 

(2) General uses for vertebrate fused 

Nucleotide sequences (or their complement) encoding vertebrate fused have various 
35 applications in the art of molecular biology, including uses as hybridization probes, in chromosome and gene 
mapping and in the generation of anti-sense RN A and DNA. Vertebrate/use^ nucleic acid will also be useful 
for the preparation of vertebrate fused polypeptides by the recombinant techniques described herein. 

The full-length native sequence vertebrate fused gene, or portions thereof, may be used as 
hybridization probes for a cDNA library to isolate the full-length gene or to isolate still other genes (for 
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instance, those encoding naturally-occurring variants of vertebrate fused or vertebrate fused from other 
species) which have a desired sequence identity to the vertebrate fused sequence disclosed in Fig. I (SEQ ID 
NO 1). Optionally, the length of the probes will be about 20 to about 50 bases. The hybridization probes 
may be derived from the nucleotide sequence of Fig. 1 (SEQ ID NO I) or from genomic sequences including 
5 promoters, enhancer elements and introns of native sequence vertebrate fused. By way of example, a 
screening method will comprise isolating the coding region of the vertebrate fused gene using the known 
DNA sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a 
variety of labels, including radionucleotides such as 32 P or 35 S, or enzymatic labels such as alkaline 
phosphatase coupled to the probe via avidin/biotin coupling systems. Labeled probes having a sequence 

1 0 complementary to that of the vertebrate fused gene of the present invention can be used to screen libraries of 
human cDNA, genomic DNA or mRNA to determine to which members of such libraries the probe 
hybridizes. Hybridization techniques are described in further detail in the Examples below. 

The probes may also be employed in PCR techniques to generate a pool of sequences for 
identification of closely related vertebrate fused sequences. 

1 5 Nucleotide sequences encoding a vertebrate fused can also be used to construct hybridization probes 

for mapping the gene, which encodes vertebrate fused and for the genetic analysis of individuals with genetic 
disorders. The nucleotide sequences provided herein may be mapped to a chromosome and specific regions 
of a chromosome using known techniques, sucn as in situ hybridization, linkage analysis against known 
chromosomal markers, and hybridization screening with libraries. 

20 Vertebrate fused polypeptides can be used in assays to identify the other proteins or molecules 

involved in complexing with fused which ultimately results in the modulation of hedgehog signaling. 
Alternatively, these molecules can modulate the fused kinase phosphorylation of its substrate. By such 
methods, inhibitors of the binding interaction car. be identified. Proteins involved in such binding 
interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding 

25 interaction. Also, the substrate of vertebrate fused can be used to isolate correlative complexing proteins. 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native 
vertebrate fused or to find those that act as a substrate for vertebrate fused. Such screening assays will 
include assays amenable to high-throughput screening of chemical libraries, making them particularly 
suitable for identifying small molecule drug candidates. Such small molecule inhibitors could block the 

30 enzymatic action of fused, and thereby inhibit hedgehog signaling. Small molecules contemplated include 
synthetic organic or inorganic compounds. The assays can be performed in a variety of formats, including 
protein-protein binding assays, biochemical screening assays, immunoassays and cell based assays, which are 
well characterized in the art. 

Nucleic acids which encode vertebrate fused or its modified forms can also be used to generate 

35 either transgenic animals or "knock out" animals which, in turn, are useful in the development and screening 
of therapeutically useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that 
contain a transsene. which transgene was introduced into the animal or an ancestor of the animal at a 
prenatal, e.g.. an embryonic stage. A transgene is a DNA that is integrated into the genome of a cell from 
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which a transeenic animal develops. In one embodiment. cDNA encoding vertebrate fused can be used to 
clone eenomic DNA encoding vertebrate fused in accordance with established techniques and the genomic 
sequences used to generate transgenic animals that contain cells which express DNA encoding vertebrate 
fused. Methods for generating transgenic animals, particularly animals such as mice or rats, have become 
5 conventional in the art and are described, for example, in U.S. Patent Nos. 4,736.866 and 4,870,009. 
Typically, particular cells would be targeted for vertebrate fused transgene incorporation with tissue-specific 
enhancers. Transgenic animals that include a copy of a transgene encoding vertebrate fused introduced into 
the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding vertebrate fused. Such animals can be used as tester animals for reagents thought to confer 

10 protection from, for example, pathological conditions associated with its overexpression. For example, for 
basal cell carcinoma, fused can be overexpressed in the basal cell layer of the skin using a Keratin 5 or 14 
promoter. In accordance with this facet of the invention, an animal is treated with the reagent and a reduced 
incidence of the pathological condition, compared to untreated animals bearing the transgene, would indicate 
a potential therapeutic intervention for the pathological condition. 

1 5 Non-human homologues of vertebrate fused can be used to construct a vertebrate fused "knock out" 

animal which has a defective or altered gene encoding vertebrate fused as a result of homologous 
recombination between the endogenous gene encoding vertebrate fused and altered genomic DNA encoding 
vertebrate fused introduced into an embryonic cell of the animal. For example, cDNA encoding vertebrate 
fused can be used to clone genomic DNA encoding vertebrate fused in accordance with established 

20 techniques. A portion of the genomic DNA encoding vertebrate fused can be deleted or replaced with 
another gene, such as a gene encoding a selectable marker that can be used to monitor integration. Typically, 
several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included in the vector [see e.g., 
Thomas and Capecchi, Cel], 51:503 (1987) for a description of homologous recombination vectors]. The 
vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 

25 introduced DNA has homologously recombined with the endogenous DNA are selected [see e.g., Li el ai. 
Cell, 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) 
to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, E. J. Robertson, ed. (1RL, Oxford, 1987), pp. 1 13-152]. A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create a 

30 "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Ktiockout animals can be characterized for instance, for their ability to 
defend against certain pathological conditions and for their development of pathological conditions due to 
absence of the vertebrate fused polypeptide. 

35 As fused has been implicated as a universal mediator for all members of the Hh family (SHh, IHh. 

DHh), disease states or disorders which are associated with general Hh signaling, would also be treatable 
with fused and antagonists and agonists thereof. For example, SHh activation (e.g. fused agonists) has 
recently been promoted as a treatment for various degenerative disorders of the nervous system, e.g., 
Parkinson's disease, memory deficits, Alzheimer's disease, Lou Gehrig's disease, Huntington's disease, 
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schizophrenia, stroke and drug addiction. Recent studies suggest that Dhh mutant males are infertile due to 
the failure of spermatocytes to complete their differentiation into mature sperm. Bitgood et al.. Curr. Biol. 6: 
298-304 (1996); Bitgood et al.. Dev. Biol. \72: 126-138 (1995). Additionally, fused agonists could be used 
to great gut diseases, bone diseases, skin diseases, diseases of the testis, ulcers, lung diseases, diseases of the 
5 pancreas, diabetes, osteoporosis. 

The presence of the protein kinase domain suggests that fused may act similarly as members of the 
protein kinase family in the modulation of Hh signaling. Protein kinases are essential elements of regulatory 
circuits in differentiated as well as growing cells; Preat et al.. Nature 347: 87-89 (1990). Many of these 
enzyme are involved in transduction of extracellular signals and operate through a cascade of 

10 phosphorylation events that amplify' and disseminate the effects of a primary signal. As described earlier, 
Drosophila fused bears significant homology to other intracellular serine/threonine kinases. Many 
serine/threonine kinases are implicated in cell-cycle control in yeasts and in mammals, Hunter, Cell 50: 823- 
829 (1987); Dunphy & Newport, Cell 55: 925-928 ( 1988); Lee & Nurse, Trend. Genet. 4: 287-290 (1988). 

Suppression or inhibition of Hh signaling is also an objective of therapeutic strategies. Since 

15 inactive fused has been shown to inhibit Hh signaling, it follows that a fused antagonist would also be 
expected to be antagonistic to Hh signaling. Limiting Hh signaling would be useful in disease states or 
disorders characterized by Hh signaling. For example, SHh is known to be active in Basal Cell Carcinoma; 
DHh is known to be active in spermatogenesis. Inhibitor or antagonist of Hh signaling would be effective 
therapeutics in the treatment of Basai Cell Carcinoma or male contraception, respectively. 

20 The stimulation of Hh signaling is also an objective of therapeutic strategies. Activating Hh 

signaling would be useful in disease states or disorders characterized by inactive or insufficient Hh signaling. 
For example, degenerative disorders of the nervous system, e.g., Parkinson's disease, memory deficits, 
Alzheimer's disease, Lou Gehrig's disease, Huntington's disease, schizophrenia, stroke and drug addiction. 
Additional ly,/i/W agonists could be used to great gut diseases, bone diseases, skin diseases, diseases of the 

25 testis (including infertility), ulcers, lung diseases, diseases of the pancreas, diabetes, osteoporosis. 
F. Anti-vertebrate fused Antibodies 

The present invention further provides anti- vertebrate fused antibodies. Exemplary antibodies 
include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies. 
1. Polyclonal Antibodies 
30 The anti-vertebrate fused antibodies may comprise polyclonal antibodies. Methods of 

preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a 
mammal, for example, by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or 
intraperitoneal injections. The immunizing agent may include the vertebrate fused polypeptide or a fusion 
35 protein thereof. It may be useful to conjugate the immunizing agent to a protein known to be immunogenic 
"in "the mammal being immunized. Examples of such immunogenic proteins include but are not limited to 
keyhole limpet hemocyanin. serum albumin, bovine thyroglobulin. and soybean trypsin inhibitor. Examples 
of adjuvants that may be employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A. synthetic trehalose dicorynomycolate). The immunization protocol may be 
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selected by one skilled in the art without undue experimentation. 
2. Monoclonal Antibodies 

The anti-vertebrate fused antibodies may, alternatively, be monoclonal antibodies. 
Monoclonal antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature . 256:495 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable 
of producing antibodies that will specifically bind to the immunizing agent. Alternatively, the lymphocytes 
may be immunized in vitro. 

The immunizing agent will typically include the vertebrate fused polypeptide or a fusion protein 
thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are 
desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired. The 
lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice . Academic Press, 
(1986) pp. 59-103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The 
hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more 
substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression 
of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from 
the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, 
Rockville, Maryland. Human myeloma and mouse-human heteromyeloma cell lines also have been 
described for the production of human monoclonal antibodies [Kozbor. J. Immunol. . 133:3001 (1984); 
Brodeur et al., Monoclonal Antibody Production Techniques and Applications . Marcel Dekker, Inc., New 
York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence 
of monoclonal antibodies directed against vertebrate fused. Preferably, the binding specificity of monoclonal 
antibodies produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding 
assay, such as radioimmunoassay (R1A) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal antibody can. for 
example, be determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Goding, supra l- Suifable culture media for this purpose include, 
for example, Duibecco's Modified Eagle's Medium and RPM1-1640 medium. Alternatively, the hybridoma 
cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
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medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example.' 
protein A-Sepharose. hydroxy lapatite chromatography, gel electrophoresis, dialysis, or affinity 
chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
5 described in U.S. Patent No. 4.816.567. DNA encoding the monoclonal antibodies of the invention can be 
readily isolated and sequenced using conventional procedures (e.g.. by using oligonucleotide probes that are 
capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The 
hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be 
placed into expression vectors, which are then transfected into host ceils such as simian COS cells, Chinese 
10 hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may be 
modified, for example, by substituting the coding sequence for human heavy and light chain constant 
domains in place of the homologous murine sequences [U.S. Patent No. 4.816,567; Morrison et ai., supra] or 
by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non- 
15 immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
domains of an antibody of the invention, or can be substituted for the variable domains of one antigen- 
combining site of an antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods fcr preparing monovalent antibodies are 
well known in the art. F or example, one method involves recombinant expression of immunoglobulin light 
20 chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to 
prevent heavy chain crosslmking. Alternatively, the relevant cysteine residues are substituted with another 
amino acid residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques known 
25 in the art. 

3. Humanized Antibodies 

The anti-vertebrate fused antibodies of the invention may further comprise humanized 
antibodies or human antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab. Fab', F(ab') 2 or other 

30 antigen-binding subsequences of antibodies) which contain minimal sequence derived from non-human 
immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in which 
residues from a complementary determining region (CDR) of the recipient are replaced by residues from a 
CDR of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances. Fv framework residues of the human immunoglobulin are replaced 

35 by corresponding non-human residues. Humanized antibodies may also comprise residues that are found 
neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two. variable domains, in 
which all or substantially "all of the CDR regions correspond to those of a non-human immunoglobulin and 
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all or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The 
humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region 
(Fc), typically that of a human immunoglobulin [Jones et al., Nature , 321 :522-525 (1986); Riechmann et al., 
Nature . 332:323-329 (1988): and Presta, Curr. Op. Struct. Biol. . 2:593-596 (1992)1. 
5 Methods for humanizing non-human antibodies are well known in the an. Generally, a humanized 

antibody has one or more amino acid residues introduced into it from a source that is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 
"import" variable domain. Humanization can be essentially performed following the method of Winter and 
co-workers [Jones et al.. Nature . 321:522-525 (1986): Riechmann et al.. Nature . 332:323-327 (1988); 

10 Verhoeyen et al.. Science , 239:1534-1536 (1988)]. by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
antibodies (U.S. Patent No. 4.816,567), wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, humanized 
antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are 

1 5 substituted by residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mol. Biol. . 227:381 (1991); Marks et al., J. Mol. Biol. . 
222:581 (1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparatio p of 
human monoclona; antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy . Alan R. Liss, p. /'7 

20 (1985) and Boerner et al.. J. Immunol. , 147(l) :86-95 (1991)]. 

4. Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have 
binding specificities for at least two different antigens. In the present case, one of the binding specificities is 
for the vertebrate fused, the other one is for any other antigen, and preferably for a cell-surface protein or 

25 receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the an. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light- 
chain pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature . 305 :537- 
539 (1983)]. Because of the random assortment of immunoglobulin heavy and light chains, these 

30 hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of which only one 
has the correct bispecific structure. The purification of the correct molecule is usually accomplished by 
affinity chromatography steps. Similar procedures are disclosed in WO 93/08829. published 13 May 1993, 
and in Traunecker et al.. EMBO J. . 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) 

35 can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an 
immunoglobulin heavy-chain constant domain, comprising at least part of the hinge. CH2;and CHS regions. 
It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary for light- 
chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain 
fusions and. if desired, the immunoglobulin light chain, are insened into separate expression vectors, and are 
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co-transfected inio a suitable host organism. For further details of generating bispecific antibodies see. for 
example. Suresh et al., Methods in Enzvmologv . 121 :2 10(1 986). 
5. Heteroconiuaate Antibodies 

Heteroconiugate antibodies are also within the scope of the present invention. 
5 Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for 
example, been proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676.980], and 
for treatment of HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the 
antibodies may be prepared in vitro using known methods in synthetic protein chemistry, including those 
involving crosslinking agents. For example, immunotoxins may be constructed using a disulfide exchange 
10 reaction or by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate 
and methyl-4-mercaptoburyrimidate and those disclosed, for example, in U.S. Patent No. 4.676,980. 

G. Uses for ant i- vertebrate fused Antibodies 

The anti-vertebrate fused antibodies of the invention have various utilities. For example, anti- 
vertebrate fused antibodies may be used in diagnostic assays for vertebrate fused, e.g., detecting its 
15 expression in specific cells, tissues, or serum. Various diagnostic assay techniques known in the an may be 
used, such as competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays 
conducted in either heterogeneous or homogeneous phases [Zola, Monoclonal Antibod ies: A Manual of 
Technique -,. CRC Press, Inc. (1987) pp. 147-158]. The antibodies used in the diagnostic assays can be 
labeled with a detectable moiety. The detectable moiety should be capable of producing, either directly or 

3 14 

20 indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such as H, C, 
32 P, 35 S, or 125 I, a fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, 
rhodamine, or luciferin. or an enzyme, such as alkaline phosphatase, beta-galactosidase or horseradish 
peroxidase. Any method known in the art for conjugating the antibody to the detectable moiety may be 
employed, including those methods described by Hunter et al.. Nature , .144:945 (1962); David et al.. 

25 Biochemistry , 13:1014 (1 974): Pain et a!., J. Immunol. Meth., 40:2 1 9 ( 1 98 1 ); and Nygren, J. Hisjochem. and 
Cvtochem. , 30:407 (1982). 

Anti-vertebrate fused antibodies also are useful for the affinity purification of vertebrate fused from 
recombinant cell culture or natural sources. In this process, the antibodies against vertebrate fused are 
immobilized on a suitable support, such a Sephadex resin or filter paper, using methods well known in the 

30 art. The immobilized antibody then is contacted with a sample containing the vertebrate fused to be purified, 
and thereafter the support is washed with a suitable solvent that will remove substantially all the material in 
the sample except the vertebrate fused, which is bound to the immobilized antibody. Finally, the support is 
washed with another suitable solvent that will release the vertebrate fused from the antibody. 

H. Fused Antagonists 

35 Several approaches may be suitably employed to create the fused antagonist and agonist compounds 

of the present invention. Any approach where the antagonist molecule can be targeted to the interior of the 
cell, which interferes or prevents wild type fused from normal operation is suitable. For example, 
competitive inhibitors, including mutant fused such as dominant negative mutant identified in the Examples. 
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which prevent fused from properly binding with other proteins necessary for Hh signaling. Additional 
properties of such antagonist or agonist molecules are readily determinable by one of ordinary skill, such as 
size, charse and hydrophobicity suitable for transmembrane transport. 

Where mimics or other mammalian homologues of fused are to be identified or evaluated, the cells 
5 are exposed to the test compound and compared to positive controls which are exposed only to human fused. 
and to negative controls which were not exposed to either the compound or the natural ligand. Where 
antagonists or agonists of fused signal modulation are to be identified or evaluated, the cells are exposed to 
the compound of the invention in the presence of the natural ligand and compared to controls which are not 
exposed to the test compound. 
10 Detection assays may by employed as a primary screen to evaluate the phosphatase 

inhibition/enhancing activity of the antagonist/agonist compounds of the invention. The assays may also be 
used to assess the relative potency of a compound by testing a range of concentrations, in a range from 100 
mM to 1 pM, for example, and computing the concentration at which the amount of phosphorylation or 

signal transduction is reduced or increased by 50% (IC^q) compared to controls. 

15 Assays can be performed to identify compounds that affect phosphorylation of fused substrates. 

Specifically, assays can be performed to identify compounds that increase the phosphorylation activity of 
fused or assays can be performed to identify compounds that decrease the phosphorylation of fused 
substrates. These assays can be performed either on whole cells themselves or on cell extracs. The assays 
can be performed in a variety of formats, including protein-protein binding assays, biochemical screening 

20 assays, immunoassays, cell based assays, etc. Such assay formats are well known in the art. 

The screening assays of the present invention are amenable to high-throughput screening of 
chemical libraries, and are particularly suitable for identifying small moiecule drug candidates. 
(' '} Antagonist and agonist molecules 

To screen for antagonists and/or agonists of fused signaling, the assay mixture is incubated under 
25 conditions whereby, but for the presence of the candidate pharmacological agent, fused induces hedgehog 
signaling with a reference activity. The mixture components can be added in any order that provides for the 
requisite hedgehog activity. Incubation may be performed at any temperature that facilitates optimal 
binding, typically between about 4° and 40°C, more commonly between about 15° and 40°C. Incubation 
periods are likewise selected for optimal binding but also minimized to facilitate rapid, high-throughput 
30 screening, and are typically between about 0.1 and 10 hours, preferably less than 5 hours, more preferably 
less than 2 hours. After incubation, the effect of the candidate pharmacological agent on the fused signaling 
is determined in any convenient way. For cell-free binding-type assays, a separation step is often used to 
separate bound and unbound components. Separation may, for example, be effected by precipitation (e.g. 
TCA precipitation, immunoprecipitation, etc.), immobilization (e.g. on a solid substrate), followed by 
35 washing. The bound protein is conveniently detected by taking advantage of a detectable label attached to it, 
e.g. by. measuring radioactive emission, optical or electron density, or by indirect detection using, e.g. 
antibody conjugates. 
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For example, a method of screening for suitable fused antagonists and/or agonists could involve the 
application of agents present in the fused activating Gli reporter assay described in the Examples. Such a 
screening assay could compare in situ hybridization in the presence and absence of the candidate antagonist 
and/or agonist 

5 in a fused expressing tissue as well as confirmation or absence of fused modulated cellular development. 
Typically these methods involve exposing an immobilized fused to a molecule suspected of binding thereto 
and determining binding or phosphorylation of the molecule to the immobilized fused and/or evaluating 
whether or not the molecule activates (or blocks activation of) fused. In order to identify such fused binding 
ligands, fused can be expressed on the surface of a cell and used to screen libraries of synthetic candidate 
1 0 compounds or naturally-occurring compounds (e.g., from endogenous sources such as serum or cells). 

Suitable molecules that affect the protein-protein interaction of fused and its binding proteins 
include fragments of the latter or small molecules, e.g., peptidomimetics, which will prevent interaction and 
proper complex formation. Such small molecules, which are usually less than 10 K molecular weight, are 
preferable as therapeutics since they are more likely to be permeable to cells, are less susceptible to 
15 degradation by various cellular mechanisms, and are not as apt to elicit an immune response as proteins. 
Small molecules include but are not limited to synthetic organic or inorganic compounds. Many 
pharmaceutical companies have extensive libraries of such molecules, which can be conveniently screened 
by using the assays of the present invention. Non-limiting examples includ" proteins, peptides, 
glycoproteins, glycopeptides, glycolipids, polysaccharides, oligosacchardies, nucleic acids, bioorganic 
20 molecules, peptidomimetics, pharmacological agents and their metabolites, transcriptional and translation 
control sequences, and the like. 

A preferred technique for identifying molecules which bind to fused utilizes a chimeric substrate 
(e.g., epitope-tagged fused or fused immunoadhesin) attached to a solid phase, such as the welt of an assay- 
plate. The binding of the candidate molecules, which are optionally labeled (e.g., radiolabeled), to the 
25 immobilized receptor can be measured. Alternatively, competition for activation of Gli can be measured. In 
screening for antagonists and/or agonists, fused can be exposed to a fused substrate followed by the putative 
antagonist and/or agonist, or the fused binding protein and antagonist and/or agonist can be added 
simultaneously, and the ability of the antagonist and/or agonist to block fused activation can be evaluated. 

(2) Detection assays 

30 The fused polypeptides are useful in assays for identifying lead compounds for therapeutically 

active agents that modulate fused hedgehog signaling. Specifically, lead compounds that either prevent the 
formation of fused signaling complexes or prevent or attenuate fused modulated hedgehog signaling (eg, 
binding to fused itself or to a substrate) can be conveniently identified. 

Various procedures known in the art may be used for identifying, evaluating or assaying the 

35 inhibition of activity of the fused proteins of the invention. As fused is believed to operate in a similar 
manner as other kinases, techniques known for use" with identifying kinase/phosphatase modulators may also 
be employed with the present invention. In general, such assays involve exposing target cells in culture to 
the compounds and a) biochemically analyzing cell lysates to assess the level and/or identity of 
phosphorylation: or (b) scoring phenorypic or functional changes in treated cells as compared to control cells 
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that were not exposed to the test substance. Such screening assays are described in U.S.P. 5.602 17 1. U.S.P. 
5,710,173. WO 96/35 124 and WO 96/40276. 

(a) Biochemical detection techniques 

Biochemical analysis techniques can be evaluated by a variety of techniques. One typical 
assay mixture which can be used with the present invention contains fused and a protein with which fused is 
normally associated (e.g. GO), usually in an isolated, partially pure or pure form. One or both of these 
components may be fused to another peptide or polypeptide, which may. for example, provide or enhance 
protein-protein binding, improve stability under assay conditions, etc. In addition, one of the components 
usually comprises or is coupled to a detectable label. The label may provide for direct detection by 
measuring radioactivity, luminescence, optical or electron density, etc.. or indirect detection such as an 
epitope tag. an enzyme, etc. The assay mixture can additionally comprise a candidate pharmacological 
agent, and optionally a variety of other components, such as salts, buffers, carrier proteins, e.g. albumin, 
detergents, protease inhibitors, nuclease inhibitors, antimicrobial agents, etc., which facilitate binding, 
increase stability, reduce non-specific or background interactions, or otherwise improve the efficiency or 
sensitivity of the assay. 

The following detection methods may also be used in a cell-free system wherein cell lysate 
containing the signal transducing substrate molecule and fused is mixed with a compound of the invention. 
The substrate is phosphorylated by initiating the kinase reaction by the additior. of adenosine triphosphate 
(ATP). To assess the activity of the compound, the reaction mixture may be analyzed by the SDS-PAGE 
technique or it may be added to substrate-specific anchoring antibody bound to a solid support, and a 
detection procedure as described above is performed on the separated or captured substrate to assess the 
presence or absence of pSer/Thr. The results are compared to those obtained with reaction mixtures to which 
the compound is not added. The cell-free system does not required the natural ligand or knowledge of its 
identity. The cell-free system does not require mixtures to which the compound is not added. The cell-free 
system does not require the natural ligand or knowledge of its identity. For example. Posner et at. (U.S.P. 
5,155,031 describes the use of insulin receptor as a substrate and rat adipocytes as target cells to demonstrate 
the ability of pervanadate to inhibit PTP activity. Another example, Burke et a/.. Biochem. Biophys. Res. 
Comm. 204: 129-134 (1994) describes the use of autophosphorylated insulin receptor and recombinant 
PTP IB in assessing the inhibitory activity of a phophotyrosyl mimetic. 

(i) Whole cell detection 

A common technique involves incubating cells with vertebrate fused and 
radiolabeled phosphate, lysing the cells, separating cellular protein components of the lysate using an SDS- 
polyacrylamide gel (SDS-PAGE) technique, in either one or two dimensions, and detecting the presence of 
phosphorylated proteins by exposing X-ray film. Detection can also be effected without using radioactive 
labeling. In such a technique, the protein components (e.g., separated by SDS-PAGE) are transferred to a 
nitrocellulose membrane where the presence' of ' phospnorylated serihe/threonmes is detected using an 
antiphosphoserine/threonine antibody (anti-pS/T). 

Alternatively, the anti-pS/T can be conjugated with an enzyme, such as horseradish peroxidase, and 
detected by subsequent addition of a colorimetric substrate for the enzyme. A further alternative involves 
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detecting the anti-PSfT by reacting with a second antibody that recognizes the anti-PS/T. this second 
antibody being labeled with either a radioactive moiety or an enzyme as prev.ously described. Examples of 
these and similar techniques are described in Hansen et aL Electrophoresis 14: 1 12-126 (1993); Campbell et 
aL J. Biol Chem. 268; 7427-7434 ( 1 993); Donato et aL Cell Growth Diff 3: 258-268 ( 1 992); Katagiri et aL. 
5 1 Immunol. 150: 585-593 (1993). Additionally, the anti- P S/T can be detected by labeling it with a 
radioactive substance, followed by scanning the labeled nitrocellulose to detect radioactivity or exposure of 
X-ray film. 

(ii) Kinase assays 

When the screening methods of the present invention for fused 
1 0 antagonists/agonists are carried out as an ex vivo assay, the target kinase (e.g. fused) can be a substantially 
purified polypeptide. The kinase substrate (e.g., MBP, Gli) is a substantially purified substrate, which in the 
assay is phosphoryiated in a reaction with a substantially purified phosphate source that is catalyzed by the 
kinase. The extent of phosphorylation is determined by measuring the amount of substrate phosphoryiated 
in the reaction. A variety of possible substrates may be used, including the kinase itself in which instance the 
1 5 phosphorylation reaction measured in the assay is autophosphorylation. Exogenous substrates may also be 
used, including standard protein substrates such as myelin basic protein (MBP); yeast protem substrates; 
synthetic peptide substrates, and polymer substrates. Of these, MBP and other standard protein substrates 
may be regarded as preferred (see Example 10). Other substrates may be .dentified. however, which are 
superior by way of affinity for the kinase, minimal perturbation of reaction kinetics, possession of single or 
20 homogenous reaction sites, ease of handling and post-reaction recover, potential for strong signal generation, 
and resistance or inertness to test compounds. 

Measurement of the amount of substrate phosphoryiated in the ex wo assay of the invention may be 
carried out by means of immunoassay, radioassay or other well-known methods. In an immunoassay 
measurement, an antibody (such as a goat or mouse anti-phosphoserine/threonine antibody) may be used 
25 which is specific for phosphoryiated moieties formed during the react.cn. Using well-known ELISA 
techniques, the phosphoserine/threonine antibody complex would itself be detected by a further antibody 
linked to a label capable of developing a measurable signal (as for example a fluorescent or radioactive 
label). Additionally, ELISA-type assays in microtitre plates may be used to test purified substrates. Peraidi 
et al.. J. Biochem. 285: 71-78 (1992); Schraag et aL Anal. Biochem. 2U: 233-239 (1993); Cleavland, Anal. 
30 Biochem. 190: 249-253 (1990); Farley, Ana!. Biochem. 203: 151-157 (1992) and Lozaro, Anal. Biochem. 
192:257-261 (1991). 

For example, detection schemes can measure substrate depletion during the kinase reaction. 
Initially, the phosphate source may be radiolabeled with an isotope such as 32 P or "P. and the amount of 
substrate phosphorylation may be measured by determining the amount of radiolabel incorporated into the 
35 substrate during the reaction. Detection may be accomplished by: (a) commercially available scintillant- 
containing plates and beads using a beta-counter, after adsorption to a filter or a microtitre well surface, or 
(b) photometric means after binding to a scintillation prox.mity assay bead or scintillant plate. Weemink and 
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Kijken. J. Biochem. Biophs. Methods 3V. 49. 1996: Braunwalder ei a/.. Anal. Biochem. 234: 23 (1996): 
Kentrup et ai, J. Biol. Chem. 2J±: 3488 (1996) and Rusken el a!. Meth. Enzymo! 200: 98 (1991). 

Preferably, the substrate is attached to a solid support surface by means of non-specific or. 
preferably, specific binding. Such attachment permits separation of the phosphoryiated substrate from 
unincorporated, labeled phosphate source (such as adenosine triphosphate prior to signal detection. In one 
embodiment, the substrate may be physically immobilized prior to reaction, as through the use of Nunc™ 
high protein binding plate (Hanke et ai. J. Biol Chem. 21±: 695 (1996)) or Wallac Sc.ntiStrip™ plates 
(Braunwalder et a/.. Anal. Biochem. 234: 23 (1996). Substrate may also be immobilized after reaction by 
capture on, for example, P81 phophocellulose (for basic peptides), PEi/acidic moiybdate resin or DEAE, or 
TCA precipitation onto Whatman™ 3MM paper. Tiganis et a!. Arch. Biochem. Biophys. 325: 289 (1996); 
Morawetz ei ai, Mol. Gen. Genet. 250; 17 (1996); Budde et al, Int J. Pharmacognosy 33. 27 (1995) and 
Casnellie, Meth. Enz. 200: 115 (1991). Yet another possibility is the attachment of the substrate to the 
support surface, as by conjugation with binding partners such as glutathione and streptavidin (in the case of 
GST and biotin), respectively) which have been attached to the support, or via antibodies specific for the tags 
which are likewise attached to the support. 

Further detection methods may be developed which are preferred to those described above. 
Especially for use in connection with high-throughput screening, it is expected that such methods would 
exhibit good sensitivity and specificity, extended linear range, low background signal, minimal fluctuation, 
compatibility with other reagents, and compatibility with automated handling systems. 

The in v/vo efficacy of the treatment of the present invention can be studied against chemically 
induced tumors in various rodent models. Tumor cell lines propagated in in vitro ceil cultures can be 
introduced in experimental rodents, e.g. mice by injection, for example by the subcutaneous route. 
Techniques for chemical inducement of rumors in experimental animals are well known in the art. 
(b) Biological detection techniques: 

The ability of the antagonist/agonist compounds of the invention to modulate the activity 
fused, which itself modulates hedgehog signaling, may also be measured by scoring for morphological or 
functional changes associated with ligand binding. Any qualitative or quantitative technique known in the 
art may be applied for observing and measuring cellular processes which comes under the control of fused. 
The activity of the compounds of the invention can also be assessed in animals using experimental models of 
disorders caused by or related to dysfunctional hedgehog signaling. For example, ineffective DHh 
hedgehog signalmg in mice leads to viable but sterile mice. The effects of mutant fused (hfused-VH) also 
affects gut development, which is regulated by IHh expression. Additionally, proper SHh signaling is critical 
to murine embryonic development at the notochord and floor plate, neural tube, distal limb structures, spinal 
column and ribs. Improper SHh signaling, is also correlative with cyclopia. Any of these phenotypic 
properties could be evaluated and quantified in a screening assay for fused antagonists and/or agonist. 
Disease states associated "with dverexpressibri of hedgehog is associated with" basal cell carcinoma while 
inactive sonic hedgehog signaling leads to improper neural development. 

The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosages for use in humans. The dosage of the compounds of the invention should lie within a range 
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of circulating concentrations with little or no toxicity. The dosage may vary within this range depending on 
the dosage form employed and the route of administration. 
(2) Antisense nucleotides 

Another preferred class of antagonists involves the use of gene therapy techniques, include the 
admin.stration of ant.sense nucleotides. Applicable gene therapy techniques include single or multiple 
administrations of therapeutically effective DNA or mRNA. Antisense RNAs and DNAs can be used as 
therapeutic agents for blocking the expression of certain genes in vivo. Short antisense oligonucleotides can 
be .mported into cells where they act as inhibitors, despite their low intracellular concentrations caused by 
restricted uptake by the ceil membrane. Zamecnik e, al, Proc. Natl. Acad. Sci. USA 83: 4143-4146 (1986). 
The oligonucleotides can be modified to enhance their uptake, e.g., by substituting their negatively charged 
phosphodi ester groups by uncharged groups. 

There are a variety of techniques known for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, ex vivo, or 
in vtvo in the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian 
cells in vitro include the use of liposomes, electroporation. microinjection, cell fusion. DEAE-dextran. the 
calcium phosphate precipitation method, etc. The currently preferred in vtvo gene transfer techniques 
mciude transfection with viral (typically retroviral) vectors and viral coat protem-liposome mediated 
transfection, Dzau et al.. Trends Biotech. U . 205-210 (1993). !n some situations it is desirable to provide the 
nucleic acid source with an agent that targets the targe, cells, s.ch as an antibody specific for a cell surface 
membrane protein associated with endocytosis may be used for targeting and/or to facilitate uptake, e.g.. 
capsid proteins or fragments thereof tropic for a particular cell type, antibod.es for prote.ns wh.ch undergo 
internalization in cycling, and proteins that target intracellular localization and enhance intracellular half-life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et al. J. Biol. Chem. 262: 
4429-4432 (1987); Wagner et al., Proc. Natl. Acad. Sci. USA 87: 34.0-3414 (1990). For a rev.ew of known 
sene marking and eene therapy protocols, see Anderson et a!.. Science 256: 808-8 1 3 (1992). 

in one embodiment, fused antagonist and/or agoms, molecules may be used to bind endogenous 
ligand in the cell, thereby causing the cell to be unresponsive xo fused wild type, especially when the levels of 
fused in the cell exceed normal physiological levels. Also, .t may be beneficial to bind endogenous fused 
substrates or complexing agents that are activating undesired cellular responses (such as proliferation of 
tumor cells). 

In a further embodiment of the invention, fused expression may be reduced by providing fused- 
expressing cells with an amount of fused antisense RNA or DNA effective to reduce expression of itefased 
protein. 

1. Diagnostic Uses 

Another use of the compounds of the invention (e.g.. human and vertebrate/,^, vertebrate fused 
variant and anti-vertebrate/^ antibodies) described herein is to help diagnose whether a disorder is driven, 
to some extent, >W or hedgehog signaling. For example, basal cell carcinoma cells are associated with 
active hedgehog signaling. 
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A diagnostic assay to determine whether a particular disorder is driven by hed°ehos signaling, can 
be carried out using the following steps: (I) culturing test cells or tissues; (2) administering a compound 
which can inhibit fused modulated hedgehog signaling; and (3) measuring the degree of kinase attenuation 
on the/used substrate in eel! lysates or hedgehog mediated phenotypic effects in the test cells. The steps can 
be carried out using standard techniques in light of the present disclosure. For example, standard techniques 
can be used to isolate cells or tissues and culturing or in vivo. 

Compounds of varying degree of selectivity are useful for diagnosing the role of fused. For 
example, compounds which inhibit fused in addition to another form of kinase can be used as an initial test 
compound to determine if one of several serine/threonine kinases drive the disorder. The selective 
compounds can then be used to further eliminate the possible role of the other serine/threonine kinases in 
driving the disorder. Test compounds should be more potent in inhibiting serine/threonine kinase activity 
than in exerting a cytotoxic effect (e.g., an IC 5O /LD 50 of greater than one). The IC 50 and LD 50 can be 
measured by standard techniques, such as an MTT assay, or by measuring the amount of LDH released. The 
degree of IC 50 /LD 50 of a compound should be taken into account in evaluating the diagnostic assay. 
Generally, the larger the ratio the more relative the information. Appropriate controls take into account the 
possible cytotoxic effect of a compound of a compound, such as treating cells not associated with a cell 
proliferative disorder (e.g., control cells) with a test compound, can also be used as pan of the diagnostic 
assay. The diagnostic methods of the invention invjlve the screening for agents that modulate the effects of 
fused upon hedgehog signaling. Exemplary detection techniques include radioactive labeling and 
immunoprecipitating (U.S.P. 5,385.915). 

The following examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

All patent and literature references cited in u;c present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to 
manufacturer's instructions unless otherwise indicated. The source of those cells identified in the following 
examples, and throughout the specification, by ATCC accession numbers is the American Type Culture 
Collection, Rockville, Maryland. 

EXAMPLE 1 
Isolation of human fused cDNA clones 
An expressed sequence tag (EST) DNA database (LIFESEQ™. Incyte Pharmaceuticals. Palo Alto. 
CA) was searched for a human homologue of the Drosophila segment poiarity gene fused (SEQ ID NO 26) 
(Preat et al.. Nature 347: 87-9 (1990)). The EST Incyte #2515662 (Fig. 2) (SEQ ID NO. 3) was identified as 
a potential candidate. In order to identify human cDNA libraries containing human fused clones, human 
cDNA libraries in pRK5 were first screened by PCR using the following primers: 
h-FUSED.f (SEQ ID NO. 8) 5"-CAATACAATGGTGCTGACATCCATCAAAGGCA-3' 
h-FUSED.r (SEQ ID NO. 9) 5'-'GAAGGGAGGGGTGCCTACTGCCA-3' 
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A fetal lung library was selected and enriched for fused cDNA clones by extension of single stranded DNA 
from plasm* libraries grown in dug /bung host using the h-FUSED.f primer (SEQ ID NO. 8) in a reaction 
containing lOul of I Ox PCR Buffer (Perkin Elmer), lul dNTP (20 mM). 1 ul library DNA (200 ng), 0.5 ml 
primer. 86.5 ul H : 0 and 1 ul of Amplitaq® (Perkin Elmer) added after a hot start. The reaction was 
5 denatured for 1 min. at 95°C. annealed for 1 min. at 60°C then extended for 20 mm. at 72°C. DNA was 
extracted with phenol/CHCl 3 . ethanol precipitated, then transformed by electroporation into DH10B host 
bacteria. Colon.es from each transformation were plated and lifted on nylon membranes and screened with 
an oligo probe derived from the EST sequence of the following sequence: 

h-FUSED.p (SEQ ID NO. 10) 5'-CTCCAGCTCTGGAGACATATAGAGTGGTGTGCCTTTGA-3' 
10 The oligo probe was labeled with [y- 32 P]-ATP and T4 polynucleotide kinase. Filters were hybridized 
overnight at 42°C in 50% formamide, 5xSSC, lOxDenhardt's, 0.05M sodium phosphate (pH 6.5), 0.1% 
sodium pyrophosphate, 50 ug/ml of sonicated salmon sperm DNA. The filters were then rinsed in 2x SSC 
and washed in 0.1 x SSC. 0.1% SDS then exposed to Kodak® X Ray films. Two positive clones (DNA28494 
(SEQ ID NO. 6) and DNA28495 (SEQ ID NO. 4) - Figs. 4 & 5) containing an insert of approximately 5 kb 
15 were isolated and sequenced. The sequence of clone DNA28495 (SEQ ID NO. 4) contains a potential 
initiation methionine at position 1 16 followed by an open reading frame of 1944 bp (Fig. 4). However, this 
open reading frame (ORF) encodes a protein that is only 648 amino acids long, somewhat shorter than the 
795 amino acid sequence of the Drosophila fused. Interestingly, a second open reading frame is present in 
the 3' reg.on of the cDNA, from nucleotide 2295 to 4349 (Fig. 4), which suggests that the cDNA may have 
20 been improperly spliced and that an intron remains between the 2 ORFs. or correspond to an alternatively 
spliced variant of fused. The sequence of clone DNA28494 (SEQ ID NO. 6) is very similar. There is one 
nucleotide difference between clone DNA28495 (SEQ ID NO. 4) and clone DNA28494 (SEQ ID NO. 6) 
located in the first ORF at position 1863 of clone 28495 (SEQ ID NO. 4) (A vs. G) which changes the 
coding sequence from an Gin to a Arg at position 583. (Fig. 4). This change is likely due to an allelic 
25 variation. The first open reading frame of DNA28494 (SEQ ID NO. 6) starts at residue 1 1 5 and is followed 
by a 647 amino acid long open reading frame. The sequences are identical except for the one change 
described above at position 583 and for the last 9 residues in the first open reading frame. 

EXAMPLE 2 
Expression of fused clones 

30 m order to determine the size of the protein expressed from the cDNA corresponding to 

DNA28495 (SEQ ID NO. 4) and DNA28494 (SEQ ID NO. 6). an HA epitope tag was inserted at the N- 
terminus of the protein by PCR using the following primers: 
Hfus.Cla-HA.F: (SEQ ID NO. 1 1) 

5*-CCATCGATGTACCCATACGACGTCCCAGACTACGCTGAAAAGTACCACGTGTTGGAGATG-3' 

35 and hFus.Xba.R: (SEQ ID NO. 12) 

5'-GCTCTAGACTAAGGGGCAGGTCCTGTGTTCTG-3\ 

The PCR product was purified, digested with Clal-Smal and subcloned into the pRK5 plasmids containing 
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DNA28494 (SEQ ID NO. 6) and DNA28495 (SEQ ID NO. 4). DNA from each of the constructs was 
transfected overnight into 293 cells using the CaP04 method (Sambrook el al. supra: Ausuble el ai. supra). 
After about 24 h. to 48 h. after transfection. the cells were harvested and the cell pellet was lysed in I ml of 
lysine buffer (50 mM Tris pH 8.0. 150 mM NaCl. 1 mM EDTA. 1% NP40, Aprotinin. Leupeptin. -PMSF, I 
5 mM NaF and 1 mM Sodium Vanadate) for 20 min at 4°C. The extract was spun for 10 min at 10K then the 
supernatant was transferred to a new rube and precleared with 20 ul Protein A sepharose for 1 h. The protein 
A sepharose was spun down and I ul of anti-HA antibody (5 ug. Boehringer) was added to each tube. After 
overnight incubation at 4°C. 30 u.1 of Protein G sepharose was added and the tubes incubated at 4°C for I 
hour. The protein G beads were then sun down for 1 min.. washed 3 times with lysis buffer, resuspended in 
10 20 ul of laemli buffer in the presence of P-mercapto ethanol. Samples were denatured for 5 min. at 100°C 
then loaded on a 6% poiyacrylamide gel. Proteins were then transferred to nitrocellulose and analyzed by 
Western blot using the same anti-HA antibody overnight at 1 ug/ml in blocking buffer (PBS, 0.5% Tween®. 
5% non fat dry milk, 3% goat serum followed by an anti-mouse HRP. ECL was used for the detection and 
the membrane was exposed for 90 seconds to X-Ray films. A specific band of 150 kDa was detected in the 
15 cell pellet of cells transfected with the construct with construct corresponding to clone DNA28494 (SEQ ID 
NO. 6) and a specific band of approximately 100 kDa could be detected for clone DNA28495 (SEQ ID NO. 
4 ) (F'g- 6). These bands were not present in the mock transfected control. The presence of the 1 50 kDa 
band suggests the two open reading frames of DNA28494 (SEQ ID NO. 6) can be spliced together to direct 
the synthesis of a large protein of 150 kDa. The absence of this band for DNA28495 (SEQ ID NO. 4) 
20 suggested that this clone apparently cannot be correctly spliced. Alternative splicing of the fused gene seems 
to lead to the production of several different products and may be a mechanism or regulation of fused 
activity. Specific regions at the C-terminus of the Drosophila fused protein is known to be required for the 
activity of the molecule, Therond el cl. Genetics \A2: 1 181-1 198 (1996); Robbins el a!.. Cell 90: 225-234 
(1997). Shorter fused molecules truncated at the C-terminus may therefore correspond to inactive or to 
25 dominant negative forms of the molecule. 

EXAMPLE 3 
Northern Blots 

In order to determine the best tissue source to isolate more fused cDNAs and to identify a transcript 
encoding a full length 150 kDa fused molecule, human multiple tissue northern blots I, II and fetal blot from 

30 Clontech were probed with a 1.6 kb. Clal-AccI fragment derived from clone DNA28494 (SEQ ID NO. 6) 
labeled by random priming. The blots were hybridized in 50% formamide, 5x SSC. IOx Denhardt's, 0.05M 
Sodium phosphate (pH 6.5), 0.1% Sodium pyrophosphate. 50 mg/ml sonicated salmon sperm DNA, all in the 
presence of 1x10" cpm/ml :2 P-labeled probe at 42°C overnight. The blots were washed in 2x SSC at RT for 
10 minutes and washed in 0.2x SSC/0. 1% SDS at 42°C for 30 minutes then exposed to x-ray film overnight. 

35 Fig. 7 shows that the fused message is expressed at high levels in testis and at low levels in most other 
tissues, including fetal tissues. (Fig. 7). 
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EXAMPLE 4 

PCR on different tissues to identify the correct splice form 
In order to isolate a cDNA where the 2 potential ORFs were spliced together correctly, we designed 
the following primers flanking the potential intron and amplified various tissues including human fetal brain. 
5 brain, keratinocyte. testis, ovary, fetal liver, and lung templates. 
Fl (SEQIDNO. 13)5'-CTGACGACACAGCAGGTTGTC-3' 
R4 (SEQ ID NO. 14) 5'-CAGATGCTTCAGGATGGACAT-3' 

Two microliters of each cDNA library was used as the template and PCR was done with Klentaq 
polymerase. PCR was performed for 45 cycles of amplification with 94°C denaturation for 1 min.. 55°C 
10 annealing for I min.. and 68°C extensions for 2 mm. One fifth of the reaction was loaded on 1% agarose gel 
and was Southern blotted. The blot was hybridized overnight with full-length fused probe labeled by random 
priming as described for the Northern blot. 

A I kb PCR fragment was identified in fetal brain, testis and ovary. This fragment was gel-purified 
and subjected to direct PCR sequencing using both the Fl and R4 primers (SEQ ID NOS. 13 and 14) 
1 5 identified above as well a the following primers: 

hfl6 (SEQ ID NO. 15) 5'-AGAGTAGCAACGTCACTGC-3' 
hf8 (SEQ ID NO. 16) 5'-CCTCACTGACAAGGCAGCAGG-3' 
hfl9 (SEQ ID NO. 17) 5'-CCCGAGGAGGCATCTGCACAG-3' 

The sequence of this 1 kb fragment revealed that intron sequences were absent and that the 2 ORFs 
20 were connected together in the same reading frame. The sequence of the correctly spliced sequence is shown 
in Fig. 1 (SEQ ID NO. 1). The initiator ATG is present at position 161 and is followed by an ORF of 3945 
nucleotides which encodes a 1315 amino acid long protein with a predicted molecular weight of 144 kDa. 

The overall similarity with Drosophila fused (SEQ ID NO. 23) is 28% (Fig. 2). The N-terminal 263 
amino acid domain of the protein containing the kinase domain is 55% homologous to the Drosophila fused 
25 kinase domain. The remaining 1052 amino acids portion of the protein is not appreciably homologous to 
other known proteins and. interestingly, is not homologous to the corresponding region in Drosophila fused. 
Interestingly, this region of non-homology includes the very C-terminus of the fly protein which appears to 
be required for activity, Robbins et ai. Cell 90: 225-34 (1997); Therond el aL Genetics 142: 1181-98 
(1996). The improperly spliced cDNAs described above may reflect alternative splicing of the fused gene 
30 which leads to the production of a molecule with a truncated C-terminus and may be a mechanism to regulate 
fused activity. 

EXAMPLE 5 

Reconstitution of the correctly spliced full leng th human fused 
The fused clone DNA28495 (SEQ ID NO. 4) was subcloned from the pRK5B plasmid into 
35 pRKS.tkneo using Clal-Hindlll. PCR was performed using human testis cDNA as a template and the 
primers hO (SEQ ID NO. 18) (CAGAACTTCAGGTCCTAAAGG) and R4 (sequence see above. Example 
4). PCR conditions were 45 cycles of (94°C.I min, 46°C to 68°C temperature gradient annealing for 1 min, 
and 68°C. 4 min). The PCR fragment was digested with Accl and ligated in the pRKS.tkneo.fused plasmid 
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cut with AccI in order to replace the region containing the intron with the correct spliced form. Two 
subclones were sequenced between the two Accl site and had the same correct sequence. 

EXAMPLE 6 
In situ hybridization 

5 El 1.3 and E13.5 mouse embryos were immersion-fixed overnight at 4°C in 4% paraformaldehyde, 

cryoprotected overnight in 15% sucrose, embedded in O.T.C. and frozen on iiquid nitrogen. Adult mouse 
brains were fresh frozen with powdered dry ice. PI mouse brains, adult mouse testis and adult rat spinal 
cords were embedded in O.T.C. and frozen on liquid nitrogen. Sections were cut at 16 mm. and processed 
for in situ hybridization for fused by the method of Phillips et al... Science 250: 290-294 (1990). RNA 

10 probes were labeled with "P-UTP as described by Melton et al., Nucleic Acids Res. J2: 7035-7052 (1984). 
Sense and antisense probes were synthesized from a mouse fused DNA fragment using T3 and T7, 
respectively, corresponding to the region encoding amino acid residues 3 1 7-486 of the human sequence. 

Figure 8 reveals that the mouse fused mRNA is widely distributed in SHh responsive tissues, 
including the neural tube, pre-somitic mesoderm, somites, developing limb buds and skin. Transcripts for 

1 5 fused were also found in the embryonic gut, testis, cartilage and muscle - Tissues that are exposed to the 
other members of the Hh protein family; Desert and Indian. In the El 1-5 mouse nervous system, high levels 
of fused transcripts were detected throughout the forebrain, midbrain, hindbrain and spinal cord. These high 
levels of expression were retained in embryonic day 13.5. In both embryonic days 11.5 and 13.5, fuse' 4 
mRNA was detected mainly in the ventral aspect of the neural tube, in regions that are likely to be exposed to 

20 the ventral midline-derived SHh. By post natal day -I, widespread expression of fused is still maintained 
throughout the brain with high levels of transcripts detected in the cortex, hypocampus. ependima and 
choroid plexus. In the adult, low levels of fused expression are detected all through the brain with higher 
levels confined to the ependima. 

The tissue distribution of fused and the Hh receptor components, Smo and Ptch show considerable 

25 overlap. All of them are initially expressed through the neural tube as well as in other Hh responsive tissues. 
However, whereas Smo mRNA was evenly distributed along the dorso-ventral axis. Ptch and fused mRNAs 
are found at higher levels ventrally, suggesting that they may be upregulated by Hh. In addition while by 
day E12, expression of both Smo and Ptch is found mainly in cells which are in close proximity to the 
ventricular zone, fused mRNA is still widely expressed and its levels decline only later. In the adult 

30 expression of both Smo and fused is confined to the ependima where neurogenesis continues. 

Detailed analysis of fused expression in adult testis was also performed by in situ hybridization (Fig. 
9). fused was found to be expressed at very high levels on stages I and 11 germ cells in the seminiferous 
tubules. Levels of fused vary in different seminiferous tubules, suggesting that its expression is regulated 
according to the germinai cell state of differentiation. 

35 EXAMPLE 7 

Gli Luciferase Assay 

Given the low homology between dfused and bfused. it was prudent to determine whether in fact the 
isolated hfused is indeed a mediator of Hh signaling. The following assay was developed to measure the 
activation of the transcription factor GLI. the mammalian homoiogue of the Drosophila cubitus imerruptus 
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(Ci). It has been shown that GLI is a transcription factor activated upon SHh stimulation of ceils. 

Nine (9) copies of a GLI binding site present in the HNF3P enhancer, (Sasaki ei al. Development 
124 : 1313-1322 (1997)), were introduced in front of a thymidine kinase minimal promoter driving the 
luciferase reporter gene in the pGL3 piasmid (Promega). The sequence of the GLI binding sequence was: 
5 TCGACAAGCAGG GAACACCCA AGTAGAAGCTC (p9XGiiLuc) (SEQ ID NO. 19). while the negative 
control sequence was: TCGACAAGCAGG GAAGTGGGA AGTAGAAGCTC (p9XmGliLuc) (SEQ ID NO. 
20). These constructs were cotransfected with the full length fused construct or with a piasmid encoding 
sonic hedgehog in C3H10TI/2 cells grown in F12, DMEM (50:50), 10% FCS heat inactivated. The day 
before transfection 1 x 10 3 cells per well was inoculated in 6 well plates, in 2 ml of media. The following 

10 day, 1 ug of each construct was cotransfected in duplicate with 0.025 mg ptkRenilla luciferase piasmid using 
lipofectamine (Gibco-BRL) in 100 ul OptiMem (with GlutaMAX) as per manufacturer's instructions for 3 
hours at 37°C. Serum (20%, I ml) was then added to each well and the cells were incubated for 3 more 
hours at 37°C. Cells were then washed twice with PBS, then incubated for 48 hours at 37°C in 2 ml of 
media. Each well was then washed with PBS, and the cells lysed in 0.5 ml Passive Lysis Buffer (Promega) 

15 for 15 min. at room temperature on a shaker. The lysate was transferred in eppendorf tubes on ice, spun in a 
refrigerated centrifuge for 30 seconds and the supernatant saved on ice. For each measure, 20 ul of cell 
lysate was added to 100 ul of LARII (luciferase assay reagent. Promega) in a polypropylene tube and the 
luciferase light activity measured. The reaction was stopped by the addition of Stop and Giow buffer 
(Promega). mixed by pipetting up and down 3 to 5 times and Renilla luciferase lights activity was measured 

20 on the luminometer. 

As shown in Figure 6, fused can induce GLI activity (9.5 fold) in a similar manner as SHh (5.5 fold) 
This result suggests that the fused gene isolated is a mediator of SHh signaling. An irrelevant serine- 
threonine kinacs, Akt, was not active in this assay (data not shown). The fused activity is dependent on an 
intact kinase domain as molecules with deletion of this region (fused C-term) (SEQ ID NO. 27) or mutation 

25 of a conserved lysine residue at about amino acid position 33 in the ATP binding site (fused-DN (SEQ ID 
NO. 25)) were not able to activate GLI. Similarly, the C-terminal tail of the protein is necessary for this 
activity since the kinase domain alone was not active in this assay (fused KD) (SEQ ID NO. 24). Expression 
of each protein was verified by Western blot using an HA tag inserted at the N-terminus of the molecule 
(data not shown). These results substantiate the conclusion that the homoiogue of the dfused isolated by 

30 Applicants is indeed hfused. Furthermore, these results indicate that fused is capable of and sufficient for the 
activation of Gli. the major target of SHh signaling and is thus likely to be a direct mediator of the SHh signal 
in vertebrates. 

EXAMPLE 8 
Induced cyclopia in frog embryos 

35 Introduction: 

In order to demonstrate that the human fused gene is not only capable of but also required to transduce the 
SHh signal in vertebrates, a mutant version of fused known as fused-DN (dominant negative) having a 
mutation of the lysine at position 33 in the ATP binding site was created (SEQ ID NO. 25). This residue is 
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conserved among all kinases and is necessary for kinase activity (Hanks et ai. Methods Enzymol. 200: 38-62 
(1991) and its conversion to any other residue in most cases results in the creation of dominant negative 
mutants. 
Methods: 
5 Plasm id Construction : 

Wild type fused cDNA (SEQ ID NO. 1) with an HA tag inserted at the carboxy terminus was 
subcloned into pRK5 and a dominant negative form was generated by conversion of lysine at positive 33 to 
an arginine. Supercoiled plasmid DNA was prepared by Qiagen and used for injection into Xenopus laevis 
embryo. 

10 Manipulation of Xenopus embrvos : 

Adult female frogs were boosted with 200 I.U. pregnant mare serum 3 days before use and with 800 
l.U. of human chorionic gonadotropin the night before injection. Fresh oocytes were squeezed out from 
female frogs the next morning and in vitro fertilization of oocytes was performed by mixing oocytes with 
minced testis from sacrificed male frogs. Developing embryos were maintained and staged according to 

1 5 Nieuwkoop and Faber, Normal Table of Xenopus laevis. N.-H. P. Co., ed. (Amsterdam. 1 967). 

Fertilized eggs were dejeilied with 2% cysteine (pH 7.8) for 10 minutes, washed once with distilled 
water and transferred to 0.1 x MBS with 5% Ficoll. Fertilized eggs were lined on injection trays in 0.1 x 
MBS with 5% Ficoll. Two-cell stage developing Xenopus embryos were injected with 200 pg of either 
pRK5 containing wild type fused (WT (SEQ ID NO. 1)) or dominant negative fused (D'r: (SEQ ID NO. 25)). 

20 Injected embryos were kept on trays for another 6 hours, after which they were transferred to 0.1 x MBS 
with 50 mg/ml gentamycin for 3 days until reaching Nieukwkoop stage 35 when eye development is 
complete. 
Results : 

To test whether human fused gene acts as a signal transducer of Hedgehog signaling, we injected 
25 wild type or dominant negative form of human fused in developing frog embryos. Embryos injected with 
120 pg of DNA divided normally in blastula stage and gastrulate normally. While eye development was 
normal in wild type, fused (SEQ ID NO. 2) injected and mock injected embryos, about 30% (Table I) of the 
embryos that were injected with fused-DN showed fused eye structure or two eyes connected by some 
pigmented retina tissue (Fig. 1 1 A). In Table I, 200 pg of plasmid DNA was delivered to the animal pole of 
30 2-celI stage embryos. Each sample represents the results of at least 3 independent experiments. Embryos 
were scored visually for cyclopia defects. 

TABLE 1 



Fusion-DN Induced Cyclopia in Xenopus Embryos 



Injected DNA 


Normal 


Cyclop 


n 


Hu-fused(SEQ ID NO. 2) 


45 


0 


45 


kinase domain (SEQ ID NO. 24) 


43 


0 


43 


C-terminus (SEQ ID NO. 27) 


53 


1 


54 


fused DN (SEQ ID NO. 25) 


32 


15 


47 


uninjected 


61 


0 


61 
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The observed cyclopia phenotype is strikingly similar to the one of mouse embryos deficient in SHh 
(Chiang et ai, Nature 383: 407-13 (1996) and of zebrafish embryos where SHh signaling has been blocked 
by overexpression of a constitutive active PICA. Hammerschmidt et ai. Genes Dev. JO: 647-58 (1996); 
5 Ungar and Moon. Dew Biol. J_7S: 186-91 (1996). In addition, both brain (forebrain) and gut development 
appeared normal at later stages of tadpole development in the fused-UH (SEQ ID NO. 25) injected embryos 
(Fig. 1 IB). In contrast, embryos overexpressing either wild type fused (SEQ ID NO. 2) or N or C-terminal 
terminal truncation mutants (SEQ ID NOS. 27 & 24. respectively) did not present any abnormalities. 

During normal development of the Xenopus eye, the eye primordium starts as a single field 
10 expressing transcription factor Pax-6, which is a vertebrate homologue of Drosophila eyeless, Li et ai, 
Development, J24: 603-15 (1997). At the neurula stage, this eye field is separated into two eye primordia 
due to an inhibiting signal from prechordal mesoderm. It has been further demonstrated that SHh is the 
prechordal mesoderm derived signal that is responsible for the inhibition of Pax-6 expression in the midline 
of the eyefield. 

15 To further understand how overexpression of/kse^-DN (SEQ ID NO: 25) induced a fused eye in 

Xenopus embryos, whole mount in situ hybridization was performed in order to determine the expression 
pattern of Pax-6 in injected embryos. As shown in Figure 1 1C. Pax-6 expression in embryos injected with 
fused-DN (SEQ ID NO: 25) remains as a single field (Fig. I ! D). Thus fused- DM (SEQ ID NO: 25) induces a 
cyclopia phenotype by most likely preventing SHh from inhibiting Pax-6 expression in the midline of the 

20 eyefield. 

EXAMPLE 9 

Rescue of fused-DN (SEQ ID NO. 25) Injected Xenopus Embryos bv Gli 
SHh expression in early floor plate cells is induced by SHh produced by the notochord. To test 
whether SHh expression in the floor plate will also be inhibited when SHh signaling is blocked, early neurula 

25 stage embryos injected with fused-DN or wild-type constructs were stained for SHh expression (See Example 
8 for procedure). SHh expression in floor plate cells or early neurula stage embryos was completely 
suppressed in 26 out of 28 embryos injected when the mutated fused is overexpressed (Table 2, Figure 1 1C. 
left embryo), while the expression of SHh was unaffected in control embryos (Fig. 6E, right embryo). Table 
2 represents scored data from three independent experiments. 100 pg of fused-DN. 1 00 pg of fused-wt or 50 

30 pg of Gli- 1 plasmid were injected in 2-cell stage embryos. Embryos were harvested at early neurula stage for 
SHh staining. 

TABLE 2 

Wild type fused and Gli rescue SHh expression in floor piate when coexpressed with fused-DN 





SHh staining 


percentage 


fused-UH 


2/28 


7% 


fused-UH + fused WT 


20/24 


83% 


fused-DN + Gli 


36/36 


100 
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To confirm thai this phenotype was due to specific inhibition of the SHh signaling pathway in the 
floor plate, we attempted to rescue the phenotype by coinjection of wt fused RNA with fitsed-OU RNA in a 
1:1 ratio. Table 2 shows that more than 80% of the embryos coinjected with wt fused mi fused-UH RNAs 
show norma! SHh staining in the floor plate. This demonstrates that SHh expression in fused-DH injected 
5 embryos is specifically blocked by inhibition of endogenous fused activity. 

To further demonstrate that the observed phenotype of fused-DN are due to disruption of the SHh 
signal cascade and to confirm that bfused works upstream of Gli in this pathway, we asked whether the 
overexpression of Gli can aiso rescue the phenotype of Xenopus embryos injected with fused-DN. As shown 
in Table 2, the rescue of SHh expression in the floor plate of fused-DN injected embryos is complete when 
10 Gli is overexpressed. Taken together, these findings are consistent with Applicants hypothesis that vertebrate 
fused functions in the SHh pathway and that is a necessary mediator in the SHh signal transduction pathway, 
which acts upstream of Gli. 

EXAMPLE 10 
Immunoprecipitations and In Vitro Kinase Assay 
1 5 To directly determine whether hfused has kinase activity, fused (SEQ ID NO. 2), fused-DN (SEQ ID 

NO. 25) and fused-kd (SEQ ID NO. 24) cDNAs were tagged with the influenza HA epitope tag and 
transiently transfected into 293 cells, lmmunoprecipitates were tested for kinase activity in the presence of 

myelin basic protein (MBP) and [y- j2 P]-ATP. The amount of 32P irtorporated into MBP was determined 

after SDS-PAGE and found to be was about 3 times higher than in fused-KD (SEQ ID NO. 24) and 2 times 
20 higher in wt fused (SEQ ID NO. 2) containing extracts compared to controls, while mutation of Lys33 to Arg 
(fused-DN (SEQ ID NO 25)) neutralizes the activity (Fig. 12). 

For immunoprecipitation experiments human embryonic kidney 293 cells were transiently 
transfected with the various expression plasmids. After 24 hours, the tranrfected cells were collected and 
lysed for 20 min. at 4°C in 1 ml of lysis buffer (50 mM Tris. pH 8.0), 150 mM NaCI. 1 mM EDTA, I mM 
25 sodium fluoride. I mM sodium orthovanadate, 1 mM PMSF and protease inhibitors (Complete, Boehringer 
Mannheim) containing 1% NP-40, 0.5% deoxycholic acid. Cell debris was removed by centrifugation for 10 
min. at 10,000 rpm and the sodium chloride concentration of the cell lysates was increased to 250 mM. The 
supernatant was precleared for I hour with 20 ul Protein A Sepharose (Pharmacia). Lysates were 
immunoprecipitated using anti-HA antibodies followed by Protein A Sepharose. The beads were washed 
30 twice with lysis buffer containing 250 mM sodium chloride, twice with lysis buffer containing 1 M sodium 
chloride, and then twice with kinase assay buffer (20 mM HEPES, pH 7.6), 1 mM DTT. 1 mM NaF and 1 
mM sodium orthovanadate). After the last wash, the beads were resuspended in 20 ul kinase assay buffer 

supplemented with 10 mCi [y- j2 P]-ATP. 20 mM (^-glycerophosphate. 20 mM PNPP, 20 mM MgCI 2 . 1 mM 

EGTA, 100 uM cold ATP and 0.5 mg/ml Myelin Basic Protein (Sigma), and incubated for 20 min. at 37°C. 
35 Reactions were stopped with 20 ul SDS-sample buffer, run on a denaturing 4-20% SDS polyacrylamide gel, 
and analyzed by phosphoimager. 
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EXAMPLE 1 1 
Expression of fused in E. coli 

The DNA sequence encoding human fused is initially amplified using selected PCR primers. The 
primers should contain restriction enzyme sites that correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from £. coli; see Bolivar et al. Gene, 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences that 
encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six ST1I codons, 
polyhis sequence, and enterokinase cleavage site), the vertebrate fused coding region, lambda transcriptional 
terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described 
in Sambrook et al., supra . Transformants are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and 
DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented 
with antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The 
cells are then grown to a desired optical density, during which the expression promoter is turned on. " 

After culturing the cells for several more hours, the celis can be harvested by centrifugation. The 
cell pellet obtained by the centrifugation can be solubilized using various agents known in the art and the 
solubilized vertebrate fused protein can then be purified using a metal chelating column under conditions that 
allow tight binding of the protein. 

EXAMPLE 12 
Expression of fused in mammalian cells 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression 
vector. Optionally, the vertebrate fused DNA is ligated into pRK5 with selected restriction enzymes to allow 
insertion of the vertebrate fused DNA using ligation methods such as described in Sambrook et al., supra . 
The resulting vector is called pRK5-fused. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) 
are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf 
serum and optionally, nutrient components and/or antibiotics. About 10 ug pRK5-fused DN A is mixed with 
about 1 ug DNA encoding the VA RNA gene [Thimmappaya et al., Cell, 3J.:543 (1982)] and dissolved in 

500 ul of 1 mM Tris-HCl. 0. 1 mM EDTA, 0.227 M CaCI 2 . To this mixture is added, dropwise, 500 ul of 50 
mM HEPES (pH 7.35), 280 mM NaCl, 1 .5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 
25°C. The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 
37°C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 
293 cells are then washed with serum free medium, fresh medium is added and the cells are incubated for 
about 5 days. 
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ttt acc cat teg cat gtc gtc tct ctt gtg agt gca gca 2569 
Phe Thr His Ser His Val Val Ser Leu Vai Ser Ala Ala 
795 800 

gec tgt eta ttg gga cag ctt ggt cag caa ggg gtg acc 2608 
5 Ala Cys Leu Leu Gly Gin Leu Gly Gin Gin Gly Val Thr 
805 810 815 

ttt gac etc cag ccc atg gaa tgg atg get gca gec aca 2647 
Phe Asp Leu Gin Pro Met Glu Trp Met Ala Ala Ala Thr 
820 825 

10 cat gec ttg tct gec cct gca gag gtt egg ttg act cca 2686 
His Ala Leu Ser Ala Pro Ala Glu Val Arg Leu Thr Pro 
830 835 840 

cca ggt agt tgt gga ttc tat gat ggc etc ctt ate ctt 2725 
Pro Gly Ser Cys Gly Phe Tyr Asp Gly Leu Leu lie Leu 
15 845 850 855 

ctg ttg cag etc etc act gag cag ggg aag get age eta 2764 
Leu Leu Gin Leu Leu Thr Glu Gin Gly Lys Ala Ser Leu 
860 865 

ate agg gat atg tec agt tea gaa atg tgg acc gtt ttg 2803 
20 lie Arg Asp Met Ser Ser Ser Glu Met Trp Thr Val Leu 
870 875 880 

tgg cac cgc ttc tec atg gtc ctg agg etc ccc gag gag 2842 
Trp' His Arg Phe Ser Met Val Leu Arg Leu Pro Glu Glu 
885 890 

25 gca tct gca cag gaa ggg gag ctt teg eta tec agt cca 2881 
Ala Ser Ala Gin Glu Gly Glu Leu Ser Leu Ser Ser Pro 
895 900 905 

cca age cct gag cca gac tgg aca ctg att tct ccc cag 2920 
Pro Ser Pro Glu Pro Asp Trp Thr Leu lie Ser Pro Gin 
30 910 915 920 

ggc atg gca gec ctg ctg age ctg gec atg gec acc ttt 2959 
Gly Met Ala Ala Leu Leu Ser Leu Ala Met Ala Thr Phe 
925 930 

acc cag gag ccc cag tta tgc ctg age tgc ctg tec cag 2998 
35 Thr Gin Glu Pro Gin Leu Cys Leu Ser Cys Leu Ser Gin 
935 940 945 

cat gga agt ate etc atg tec ate ctg aag cat ctg ctt 3037 
His Gly Ser lie Leu Met Ser lie Leu Lys His Leu Leu 
950 955 

40 tgc ccc age ttc ctg aat caa ctg cgc cag gcg cct cat 3076 
Cys Pro Ser Phe Leu Asn Gin Leu Arg Gin Ala Pro His 
960 965 970 

ggg tct gag ttt etc cct gtc gtg gtg etc tct gtc tgc 3115 
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600 605 

tac tec age ttg ctg acg aca cag cag gtt gtc ttg gat 2023 
Tyr Ser Ser Leu Leu Thr Thr Gin Gin Val Val Leu Asp 
6X0 615 620 

ggg etc ctt cat ggc ttg aca gtt cca cag etc cct gtc 2062 
Gly Leu Leu His Gly Leu Thr Val Pro Gin Leu Pro Val 
625 630 

cac act ccc caa gga gee ccg caa gtg age cag cca ctg 2101 
His Thr Pro Gin Gly Ala Pro Gin Val Ser Gin Pro Leu 
635 640 645 

cga gag cag agt gag gat ata cct gga gec att tec tct 2140 
Arg Glu Gin Ser Glu Asp lie Pro Gly Ala He Ser Ser 
650 ' 655 660 

gee ctg gca gee ata tgc act get cct gtg gga ctg ccc 2179 
Ala Leu Ala Ala He Cys Thr Ala Pro Val Gly Leu Pro 
665 670 

gac tgc tgg gat gee aag gag cag gtc tgt tgg cat ttg 2218 
Asp Cys Trp Asp Ala Lys Glu Gin Val Cys Trp His Leu 
675 680 685 

gca aat cag eta act gaa gac age age cag etc agg cca 2257 
Ala Asn Gin Leu Thr Glu Asp Ser Ser Gin Leu Arg Pro 
6S0 695 

tec etc ate tct ggc ctg cag cat ccc ate ctg tgc ctg 2296 
Ser Leu He Ser Gly Leu Gin His Pro He Leu Cys Leu 
700 705 710 

cac ctt etc aag gtt eta tac tec tgc tgc ctt gtc agt 2335 
His Leu Leu Lys Val Leu Tyr Ser Cys Cys Leu Val Ser 
715 ~ 720 725 

gag ggc ctg tgc cgt ctt ctg ggg cag gag ccc ctg gee 2374 
Glu Gly Leu Cys Arg Leu Leu Gly Gin Glu Pro Leu Ala 
730 735 

ttg gaa tec ctg ttt atg ttg att cag ggc aag gta aaa 2413 
Leu Glu Ser Leu Phe Met Leu He Gin Gly Lys Val Lys 
740 745 750 

gta gta gat tgg gaa gag tct act gaa gtg aca etc tac 24 52 
Val Val Asp Trp Glu Glu Ser Thr Glu Val Thr Leu Tyr 
755 760 

ttc etc tec ctt ctt gtc ttt egg etc caa aac ctg cct 24 91 
Phe Leu Ser Leu Leu Val Phe Arg Leu Gin Asn Leu Pro 
765 770 775 

tgt gga atg gag aag eta ggc agt gac gtt get act etc 2530 
Cys Gly Met Glu Lys Leu Gly Ser Asp Val Ala Thr Leu 
780 785 790 
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Pro Asp 
415 

act gag 
Thr Glu 

5 

ttg ctg 
Leu Leu 
440 

cag ctg 
10 Gin Leu 



ttg gag 
Leu Glu 



15 ctg age 
Leu Ser 
480 

ttg tat 
Leu Tyr 

20 

ctg ctg 
Leu Leu 
505 

etc cag 
2 5 Leu Gin 



ctg atg 
Leu Met 



30 aat ctg 
Asn Leu 
545 

ttt cag 
Phe Gin 

35 

aaa ctg 
Lys Leu 
570 

egg agg 
4 0 Arg Arg 



gec atg 
Ala Met 



Ser Asp Asn 



cct gtg cct 
Pro Val Pro 
430 

tgt aat cct 
Cys Asn Pro 



cat gaa get 
His Glu Ala 
455 

ggt get tec 
Gly Ala Ser 
470 

agt ctt etc 
Ser Leu Leu 



tec ttc tgc 
Ser Phe Cys 
495 

agt eta etc 
Ser Leu Leu 



cag caa tct 
Gin Gin Ser 
520 

get gtg att 
Ala Val He 
535 

gag agg age 
Glu Arg Ser 



gag get gec 
Glu Ala Ala 
560 

ctg gee caa 
Leu Ala Gin 



gac age ctt 
Asp Ser Leu 
585 

gat ggg aac 
Asp Gly Asn 



Glu Trp Gin 
420 

att caa ctg 
He Gin Leu 



gac ttc tgc 
Asp Phe Cys 
445 

gga ggg cag 
Gly Gly Gin 
460 

cac ate ctg 
His He Leu 



tec age tgc 
Ser Ser Cys 
485 

egg gag gca 
Arg Glu Ala 



agg cac agt 

Arg His Ser 
510 

tgg tat ggg 

Trp Tyr Gly 
525 

cag gee tac 

Gin Ala Tyr 



cag aca agt 
Gin Thr Ser 
550 

aac ctt ttt 
Asn Leu Phe 



cca gat gac 
Pro Asp Asp 
575 

atg tgc ttt 
Met Cys Phe 
590 

age egg gec 
Ser Arg Ala 



His Leu Leu 



aag get cct 
Lys Ala Pro 

435 

cag cgc ate 
Gin Arg He 
450 

ate ctg aaa 
lie Leu Lys 



cct gca ttc 
Pro Ala Phe 
475 

agt gat tct 
Ser Asp Ser 



ggg ctt cct 
Gly Leu Pro 
500 

cag gag age 
Gin Glu Ser 
515 

acc ttc tta 
Thr Phe Leu 



ttt gee tgt 
Phe Ala Cys 
54 0 

gac age ctg 
Asp Ser Leu 



ctg gac ctg 
Leu Asp Leu 
565 

tct gag cag 
Ser Glu Gin 
580 

act gtc ctg 
Thr Val Leu 



ate tec aaa 
He Ser Lys 



Glu Thr 
425 

etc acc 1477 
Leu Thr 



cag agt 1516 
Gin Ser 



ggc ate 1555 
Gly He 
465 

egg gtc 1594 
Arg Val 



gtt gee 1633 
Val Ala 
490 

ggg ctg 1672 
Gly Leu 



aac age 1711 
Asn Ser 



cag gac 1750 
Gin Asp 
530 

acc ttc 1789 
Thr Phe 



cag gtg 1828 
Gin Val 
555 

ttg ggg 1867 
Leu Gly 



act ttg 1906 
Thr Leu 



tgc gaa 194 5 
Cys Glu 
595 

gee ttt 1984 
Ala Phe 
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cag gga 
Gin Gly 



tgg cca 
5 Trp Pro 
245 

gtc acc 
Val Thr 



10 acc cca 
Thr Pro 



eta aag 
Leu Lys 
15 285 

aat cag 
Asn Gin 



get gag 
20 Ala Glu 
310 

cct gec 
Pro Ala 



25 cct ggc 
Pro Gly 



cag gaa 
Gin Glu 
30 350 

ttg aag 
Leu Lys 



ccc tct 
35 Pro Ser 
375 

gaa cga 
Glu Arg 



40 cag egg 
Gin Arg 



cca gac 



ctg etc acc 
Leu Leu Thr 
235 

gac etc tta 
Asp Leu Leu 



ata ata act 
lie lie Thr 
260 

ttc acc age 
Phe Thr Ser 
275 

gac gaa cag 
Asp Glu Gin 



tct cgc ate 
Ser Arg lie 
300 

gag gec atg 
Glu Ala Met 



ctt gag caa 
Leu Glu Gin 
325 

aca gec cct 
Thr Ala Pro 
340 

tea age etc 
Ser Ser Leu 



age age tgg 
Ser Ser Trp 
365 

gca cct egg 
Ala Pro Arg 



gca ttc cca 
Ala Phe Pro 
390 

age act gat 
Ser Thr Asp 
405 

agt gac aat 



aaa gac cca 
Lys Asp Pro 



tat cac ccc 

Tyr His Pro 
250 

gag cca gca 
Glu Pro Ala 
265 

cgc eta ccc 
Arg Leu Pro 



gec cat egg 
Ala His Arg 
290 

ttg act cag 
Leu Thr Gin 



cag aag aaa 
Gin Lys Lys 
315 

gag gac aag 
Glu Asp Lys 
330 

ctg ccc aga 
Leu Pro Arg 



ctg gec ggg 
Leu Ala Gly 
355 

get aaa tea 
Ala Lys Ser 



gaa aac egg 
Glu Asn Arg 
380 

gag gag agg 
Glu Glu Arg 
395 

gta gtg gac 
Val Val Asp 



gag tgg cag 



egg cag cga 
Arg Gin Arg 
240 

ttt att get 
Phe lie Ala 
255 

ggc cca gat 
Gly Pro Asp 



cca gaa ctt 
Pro Glu Leu 
280 

ttg gec ccc 
Leu Ala Pro 



gee tat aaa 
Ala Tyr Lys 
305 

cat cag aac 
His Gin Asn 
320 

acc age aag 
Thr Ser Lys 



etc ggg gee 
Leu Gly Ala 
345 

ate tta gee 
lie Leu Ala 



ggg act gga 
Gly Thr Gly 
370 

acc acc cca 
Thr Thr Pro 
385 

cca gag gtg 
Pro Glu Val 



ctg gaa aat 
Leu Glu Asn 
410 

cac ctg eta 



ctg tec 892 
Leu Ser 



ggt cat 931 
His 



ttg ggg 97 0 
Leu Gly 
270 

cag gtc 1009 
Gin Val 



aag ggt 1048 
Lys Gly 
295 

cgc atg 1087 
Arg Met 



aca gga 1126 
Thr Gly 



gtg get 1165 
Val Ala 
335 

act cct 1204 
Thr Pro 



tea gaa 1243 
Ser Glu 
360 

gag gtg 1282 
Glu Val 



gat tgt 1321 
Asp Cys 



ctg ggc 1360 
Leu Gly 
400 

gag gag 1399 
Glu Glu 



gag acc 1438 
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cga gag 
Arg Glu 
50 

5 att gtg 
He Val 



gtg gtg 
Val Val 

10 

cag ate 
Gin He 
90 

gtt cag 
15 Val Gin 



tat ctg 
Tyr Leu 
115 

20 cct cag 
Pro Gin 



etc tgt 
Leu Cys 

25 

aca atg 
Thr Met 
155 

atg tct 
30 Met Ser 



aca gcg 
Thr Ala 
180 

35 ctg gca 
Leu Ala 



ttt cag 
Phe Gin 

40 

tgg ccc 
Trp Pro 
220 



40 

att gaa ata 
He Glu He 



cat atg ctt 
His Met Leu 
65 

gtg gtg aca 
Val Val Thr 
80 

eta gaa gat 
Leu Glu Asp 



gee att get 
Ala He Ala 
105 

cat tec cac 
His Ser His 



aac ate etc 
Asn He Leu 
130 

gac ttt gga 
Asp Phe Gly 
145 

gtg ctg aca 
Val Leu Thr 



cca gag ctg 
Pro Glu Leu 
170 

gac etc tgg 
Asp Leu Trp 



gta ggc acc 
Val Gly Thr 
195 

ctg gtc age 
Leu Val Ser 
210 

tea acc ate 
Ser Thr He 



atg egg ggt 
Met Arg Gly 
55 

gac age ttt 
Asp Ser Phe 
70 

gac tat get 
Asp Tyr Ala 



gac gga aaa 
Asp Gly Lys 
95 

gee cag ttg 
Ala Gin Leu 



cgc ate eta 
Arg He Leu 
120 

etc gee aag 
Leu Ala Lys 
135 

ttt gee egg 
Phe Ala Arg 



tec ate aaa 
Ser He Lys 
160 

gtg gag gag 
Val Glu Glu 



tct gtt ggc 
Ser Val Gly 
185 

cct ccc ttc 
Pro Pro Phe 
200 

etc att etc 
Leu He Leu 



agt ccc tgc 
Ser Pro Cys 
225 



45 

ctg egg cat 
Leu Arg His 
60 

gaa act gat 
Glu Thr Asp 



gag gga gag 
Glu Gly Glu 
85 

ctt cct gaa 
Leu Pro Glu 



gtg tea gee 
Val Ser Ala 
110 

cac cga gat 
His Arg Asp 
125 

ggt ggt ggc 
Gly Gly Gly 



get atg age 
Ala Met Ser 
150 

ggc aca cca 
Gly Thr Pro 



cga cca tac 
Arg Pro Tyr 
175 

tgc ata eta 
Cys He Leu 
190 

tat get aca 
Tyr Ala Thr 



aag gac cct 
Lys Asp Pro 
215 

ttt aag aac 
Phe Lys Asn 



ccc aac 346 
Pro Asn 



aaa gag 385 
Lys Glu 
75 

etc ttt 424 
Leu Phe 



gac cag 463 
Asp Gin 
100 

ctg tac 5 02 
Leu Tyr 



atg aag 541 
Met Lys 



ate aag 580 
He Lys 
140 

acc aat 619 
Thr Asn 



etc tat 658 
Leu Tyr 
165 

gac cac 697 
Asp His 



tat gaa 736 
Tyr Glu 



age ate 775 
Ser He 
205 

gtg cgc 814 
Val Arg 



ttc ctg 853 
Phe Leu 
230 
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Sequence Listing 

<110> Genentech, Inc. 

<120> Fused 

<130> P1272PCT 

5 <141> 1999-02-25 

<150> US 09/031,563 
<151> 1998-02-26 

<160> 27 

<210> 1 

10 <211> 4880 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unknown N 
15 <222> 4160 

<223> unknown 

<220> 

<221> unknown N 
<222> 4243 
20 <223> unknown 

<220> 

<221> unknown N 
- 222> 4361 
<223> unknown 

25 <400> 1 

cccggggatc ctctagagat ccctcgacct cgacccacgc gtccgcccac 50 

gcgtccgccc acgcgtccgg ggcgtcccag atgttgtgga actgtccctg 100 
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28. A method of screening for antagonist or agonist molecule of fused biological activity comprising: 

(a) exposing a fused substrate and a compound having fused biological activity to a candidate 
antagonist or agonist; and 

5 (b) analyzing the substrate to assess the level and/or identity of phosphorylation; and 

comparing the results to control reactions which were not exposed to the candidate molecule. 

29. A method of diagnosing to determine whether a particular disorder is modulated by hedgehog 
signaling, comprising: 

10 (a) culturing test cells or tissues; 

(b) administering a compound which can inhibit fused modulated hedgehog signaling; and 

(c) measuring the degree of kinase attenuation on the fused substrate in cell lysates or 
hedgehog mediated phenorypic effects in the test cells. 
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15. The host cell of claim 14 which is Saccharomyces cerevisiae. 

16. A process for producing vertebrate fused polypeptides comprising culturing the host cell of claim 9 
under conditions suitable for expression of vertebrate fused and recovering vertebrate fused from the cell 
culture. 

17. Isolated native sequence human fused polypeptide comprising amino acid residues 1 to 1315 of Fig. 
1. 

18. Isolated native sequence human fused polypeptide encoded by the nucleotide deposited under 
accession number ATCC 209637 having/i«eo r biological activity. 

19. A chimeric molecule comprising vertebrate fused polypeptide fused to a heterologous amino acid 
sequence. 

20. The chimeric molecule of claim 19 wherein said heterologous amino acid sequence is an epitope tag 
sequence. 

21. The chimeric molecule of claim 22 wherein said heterologous amino acid sequence is a constant 
region of an immunoglobulin. 

22. An antagonist of vertebrate fused which biocks. prevents, inhibits and/or neutralizes the normal 
functioning of fused in the Hh signaling pathway. 

23. The antagonist of claim 22 which is a small bioorganic moiecule. 

24. The antagonist of claim 22 which is an antisense nucleotide. 

25. An agonist of vertebrate fused with stimulates or enhances the normal functioning of fused in the Hh 
signaling pathway. 

26. The agonist of claim 25 which is a small bioorganic molecule. 

27. A method of screening for antagonists or agonists of fused biological activity comprising: 

(a) exposing the fused expressing target cells in culture to a candidate compound: and 

(b) analyzing cell lysates to assess the level and/or identity of phosphorylation; or 

(c) scoring phenotypic or functional changes in treated cells: 

and comparing the results to control cells which were not exposed to the candidate compound. 
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What is claimed is: 

1. Isolated nucleic acid comprising DNA having at least a 95% sequence identity to (a) a DNA 
molecule encoding a vertebrate polypeptide comprising the sequence of amino acids 1 to about 260 of 
Fig. 1 (SEQ ID NO:24), or (b) the complement of (a): and encoding a polypeptide having fused biological 
activity. 

2. The isolated nucleic acid of claim I comprising DNA having at least a 95% sequence identity to (a) 
a DNA molecule encoding a human fused polypeptide comprising the sequence of amino acids 1 to 1315 
(SEQ ID NO: 2) of Fig. 1, or (b) the complement of the DNA molecule of (a). 



3. The isolated nucleic acid of claim 1 comprising DNA encoding a vertebrate fused polypeptide 
having amino acid residues 1 to 260 of Fig. 1 (SEQ ID NO:24). 

4. The isolated nucleic acid of claim I comprising DNA encoding a vertebrate fused polypeptide 
15 having a lysine at amino acid position 33. 

5. An isolated nucleic acid comprising DNA having at least a 95% sequence identity to (a) a DNA 
molecule encoding the same mature polypeptide encoded by the cDNA in ATCC Deposit No. 20963" 7 . or (b) 
the complement of the DNA molecule of (a). 



6. The isolated nucleic acid of claim 5 comprising human fused encoding sequence of the cDN A in 
ATCC deposit No. 209637, or a sequence which hybridizes thereto under stringent conditions. 

7. A vector comprising the nucleic acid of claim I . 

8. Tne vector of claim 7 operably linked to control sequences recognized by a host cell transformed 
with the vector. 

9. A host cell transformed with the vector of claim 8. 

10. The host cell of claim 9 which is mammalian. 



11. The host cell of claim 1 0 wherein said cell is a CHO cell. 

1 2. The host cell of claim 9 which is prokaryotic. 

13. The host cell of claim 12 wherein said cell is an E. colt. 

1 4. The host cell of claim 9 wherein said cell is a yeast cell. 
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produced in ihe ascites can be accomplished using ammonium sulfate precipitation, followed by gel 
exclusion chromatography. Alternatively, affinity chromatography based upon binding of antibody to 
protein A or protein G can be employed. 

Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 12301 
Parklawn Drive, Rockville. MD. USA (ATCC): 

Designation: ATCC Dep. No. Deposit Date 

pRK5tkneo.hFused-1272 209637 2/19/98 

This deposit was made under the provisions of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations 
thereunder (Budapest Treaty). This assures maintenance of a viable culture of the deposit for 30 years from 
the date of deposit. The deposit will be made available by ATCC under the terms of the Budapest Treaty, 
and subject to an agreement between Genentech, Inc. and ATCC, which assures permanent and unrestricted 
availability of the progeny of the culture of the deposit to the public upon issuance of the pertinent U.S. 
patent or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and 
assures availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks 
to be entitled thereto according to 35 USC §122 and the Commissioner's rules pursuant thereto (including 37 
CFR §1.14 with particular reference to 886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit shoulc 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced 
on notification with another of the same. Availability of the deposited material is not to be construed as a 
license to practice the invention in contravention of the rights granted under the authority of any government 
in accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since 
the deposited emoodiment is intended as a single illustration of certain aspects of the invention and any 
constructs that are functionally equivalent are within the scope of this invention. The deposit of material 
herein does not constitute an admission that the written description herein contained is inadequate to enable 
the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as 
limiting the scope of the claims to the specific illustrations that it represents. Indeed, various modifications 
of the invention in addition to those shown and described herein will become apparent to those skilled in the 
art from the foregoing description and fall within the scope of the appended claims. 
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baseline A 280 with loading buffer, a. which point fraction collection is started. Next, the column is washed 
with a secondary wash buffer (50 mM phosphate; 300 mM NaCl. 10% Glycerol. P H 6.0). which elutes 
nonspecifically bound protein. After reaching A 280 baseline again, the column is developed with a 0 to 500 
mM Imidazole gradient in the secondary wash buffer. One mL fractions are collected and analyzed by SDS- 
5 PAGE and silver staining or western blot with Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). 
Fractions containing the eluted His 10 -tagged vertebrate fused are pooled and dialyzed against loading buffer. 
Alternatively, purification of the IgG tagged (or Fc tagged) vertebrate /.W can be performed using known 
chromatography techniques, including for instance. Protein A or protein G column chromatography 

EXAMPLE 15 

)0 Preparation of Antibodies that Bind Vertebrate fuse d 

This example illustrates preparation of monoclonal antibodies, which can specifically bind 
vertebrate fused. 

Techniques for producing the monoclonal ant.bod.es are known in the art and are described, for 
instance, in Coding, supra. Immunogens that may be employed include purified vertebrate/^, fusion 

15 proteins containing vertebrate fused, and cells expressing recombinant vertebrate/^ on the cell surface. 
Selection of the immunogen can be made by the skilled artisan without undue experimentation. 

Mice, such ks Balb/c, are immunized with the vertebrate/^ immunogen emulsified in 
complete Freund's adjuvant and injected subcutaneously or intraper.toneally in an amount from 1-100 
micrograms. Alternatively, the immunogen is emulsified .n MPL-TDM adjuvant (Ribi Immunochemical 

20 Research, Hamilton, MT) and injected into the animals hind foot pads. The immun.zed mice are then 
boosted 10 to 12 days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for 
several weeks, the mice may also be boosted with additional immunization injections. Serum samples may 
be periodically obtained from the mice by retro-orbital bleeding for testing .n EL1SA assays to detect 
vertebrate fused antibodies. 

25 After a suitable antibody titer has been detected, the animals "positive" for antibodies can 

be injected with a final intravenous injection of vertebrate fused. Three to four days later, the mice are 
sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene glycol) 
to a selected murine myeloma cell line such as P3X63AgU.t, available from ATCC, No. CRL 1597. The 
fusions generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT 

30 (hypoxanthine. aminopterin, and thymidine) medium to inhibit proliferation of non-fused cells, myeloma 

hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an EL1SA for reactivity against vertebrate fused. 
Determination of "positive" hybridoma cells secreting the desired monoclonal antibodies against vertebrate 

fused is within the skill in the art. 
35 The positive hybr.doma ceils can be injected intraper.toneally into syngeneic Balb/c mice to 

produce ascites containing the ami- vertebrate fused monoclonal antibodies. Alternatively, the hybridoma 
cells can be grown in tissue culture flasks or roller bottles. Purification of the monoclonal antibodies 
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plasmid, together with DNA encoding the ADH2/GAPDH promoter, the yeast alpha-factor secretory 
signal/leader sequence, and linker sequences (if needed) for expression of vertebrate fused. 

Yeast cells, such as yeast strain AB110. can then be transformed with the expression piasmids 
described above and cultured in selected fermentation media. The transformed yeast supematants can be 
analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE. followed by staining 
of the gels with Coomassie Blue stain. 

Recombinant vertebrate fused can subsequently be isolated and purified by removing the yeast cells 
from the fermentation medium by centrifugation and then concentrating the medium using selected cartridge 
filters. The concentrate containing vertebrate fused may further be purified using selected column 
chromatography resins. 

EXAMPLE 14 

Expression of vertebrate fused in Baculovirus-lnfected Insect Cells ' 
The following method describes recombinant expression of vertebrate /i«ec/ in Baculovirus-infected 
insect cells. 

The vertebrate fused is fused upstream of an epitope tag contained within a baculovirus expression 
vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). A variety 
of piasmids may be employed, including piasmids derived from commercially available piasmids such as 
pVL1393 (Novagen). Briefly, the venebnte fused or the desired portion of the vertebrate fused(such as the 
sequence encoding the extracellular domain of a transmembrane protein) is amplified by PCR with primers 
complementary to the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme 
sites. The product is then digested with those selected restriction enzymes and subcloned into the expression 
vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 
DNA (Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin 
(commercially available from G1BCO-BRL). After 4-5 days of incubation at 28°C. the released viruses are 
harvested and used for further amplifications. Viral infection and protein expression is performed as 
described by O'Reilley et al.. Baculovirus expression vectors: A Laboratory Manual, Oxford: Oxford 
University Press (1994). 

Expressed poly-his tagged vertebrate fused can then be purified, for example, by Ni 2+ -chelate 

affinity chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as 
described by Rupert et al., Nature . 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in 
sonication buffer (25 mL Hepes, pH 7.9: 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% Glycerol; 0.1% NP-40: 0.4 

M KC1), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the 
supernatant is diluted 50-fold in loading buffer (50 mM phosphate. 300 mM NaCI. 10% Glycerol, pH 7.8) 
and filtered through a 0.45 um filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is 

prepared with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading 
buffer. The filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to 
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Approximately 24 hours after the transfections. the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 uCi/ml j3 S-cysteine and 200 uCi/ml j5 S- 
methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, 
and loaded onto a 15% SDS gei. The processed gel may be dried and exposed to film for a selected period of 
time to reveal the presence of vertebrate fused polypeptide. The cultures containing transfected cells may 
underso further incubation (in serum free medium) and the medium is tested in selected bioassays. 

in an alternative technique, vertebrate fused may be introduced into 293 cells transiently using the 
dextran sulfate method described by Somparyrac el a!.. Proc. Natl. Acad. ScL. 12:7575 (1981). 293 cells are 
grown to maximal density in a spinner flask and 700 ug pRK5-fused DNA is added. The cells are first 
concentrated from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is 
incubated on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed 
with tissue culture medium, and re-introduced into the spinner flask containing tissue culture medium, 5 
ua/ml bovine insulin and 0.1 ug/ml bovine transferrin. After about four days, the conditioned media is 
centrifuged and filtered to remove cells and debris. The sample containing expressed vertebrate fused can 
1 5 then be concentrated and purified by any selected method, such as dialysis and/or column chromatography. 

In another embodiment, vertebrate fused can be expressed in CHO cells. The pSVi-fused can be 
transfected into CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the 
ceil cultures can be incubated, and the medium replace j with culture medium (alone) or medium containing a 
radiolabel such as 35 S-methionine. After determining the presence of vertebrate fused polypeptide, the 
20 culture medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 
days, and then the conditioned medium is harvested. The medium containing the expressed vertebrate fused 
can then be concentrated and purified by any selected method. 

Epitope-tagged vertebrate fused may also be expressed in host CHO cells. The vertebrate fused may 
be subcloned out of the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected 
25 epitope tag such as a poly-his tag into an expression vector. The poly-his tagged vertebrate fused insert can 
then be subcloned into a SV40 driven vector containing a selection marker such as DHFR for selection of 
stable clones. Finally, the CHO cells can be transfected (as described above) with the SV40 driven vector. 
Labeling may be performed, as described above, to verify expression. The culture medium containing the 
expressed poly-His tagged vertebrate fused can then be concentrated and purified by any selected method, 
30 such as by Ni 2+ -chelate affinity chromatography. 

EXAMPLE 13 
Expression of vertebrate fused in Yeast 
The following method describes recombinant expression of vertebrate /wsetf in yeast. 
First, yeast expression vectors are constructed for intracellular production or secretion of vertebrate 
35 fused from the ADH2/GAPDH promoter. DNA encoding vertebrate fused, a selected signal peptide and the 
promoter is inserted into suitable restriction enzyme sites in the selected plasmid to direct intracellular 
expression of vertebrate fused. For secretion. DNA encoding vertebrate fused can be cloned into the selected 
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Gly Ser Glu Phe Leu Pro Val Val Val Leu Ser Val Cys 
975 980 985 



cag etc ctt tgc ttc ccc ttt gcg ctg gac atg gat get 3154 
Gin Leu Leu Cys Phe Pro Phe Ala Leu Asp Met Asp Ala 
990 SS5 



gac etc ctt ata gtt gtc ttg gec gac etc agg gac tea 3193 
Asp Leu Leu lie Val Val Leu Ala Asp Leu Arg Asp Ser 
1000 1005 1010 



gaa gtt gca gee cat ctg ctg cag gtc tgc tgc tac cat 3232 
10 Glu Val Ala Ala His Leu Leu Gin Val Cys Cys Tyr His 
1015 1020 



ctt ccg ttg atg caa gtg gag ctg ccc ate age ctt etc 3271 
Leu Pro Leu Met Gin Val Glu Leu Pro lie Ser Leu Leu 
1025 1030 1035 



15 aca cgc ctg gec etc atg gat ccc acc tct etc aac cag 3310 
Thr Arg Leu Ala Leu Met Asp Pro Thr Ser Leu Asn Gin 
1040 1045 1050 



ttt gtg aac aca gtg tct gec tec cct aga acc ate gtc 3349 
Phe Val Asn Thr Val Ser Ala Ser Pro Arg Thr lie Val 
20 1055 1060 



teg ttt etc tea gtt gee etc ctg agt gac cag cca ctg 3388 
Ser Phe Leu Ser Val Ala Leu Leu Ser Asp Gin Pro Leu 
1065 1070 1075 



ttg acc tec gac 
25 Leu Thr Ser Asp 
1080 



ctt etc tct ctg ctg 
Leu Leu Ser Leu Leu 
1085 



gee cat act gec 3427 
Ala His Thr Ala 



agg gtc ctg tct ccc age cac ttg tec ttt ate caa gag 3466 
Arg Val Leu Ser Pro Ser His Leu Ser Phe lie Gin Glu 
1090 1095 1100 



30 ctt ctg get ggc tct gat gaa tec tat egg ccc ctg cgc 3505 

Leu Leu Ala Gly Ser Asp Glu Ser Tyr Arg Pro Leu Arg 
1105 1110 1115 

age etc ctg ggc cac cca gag aat tct gtg egg gca cac 3544 

Ser Leu Leu Gly His Pro Glu Asn Ser Val Arg Ala His 
35 1120 1125 



act tat agg etc ctg gga cac ttg etc caa cac age atg 3583 
Thr Tyr Arg Leu Leu Gly His Leu Leu Gin His Ser Met 
1130 1135 1140 



gee ctg cgt ggg gca ctg cag age cag tct gga ctg etc 3622 
4 0 Ala Leu Arg Gly Ala Leu Gin Ser Gin Ser Gly Leu Leu 
. 1145 . 1150. 



age ctt ctg ctg ctt ggg ctt gga gac aag gat cct gtt 3661 
Ser Leu Leu Leu Leu Gly Leu Gly Asp Lys Asp Pro Val 
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1155 1160 1165 

gtg egg tgc agt gec age ttt get gtg ggc aat gca gec 3700 
Val Arg Cys Ser Ala Ser Phe Ala Val Gly Asn Ala Ala 
1170 1175 1180 

5 tac cag get ggt cct ctg gga cct gec ctg gca get gca 3739 
Tyr Gin Ala Gly Pro Leu Gly Pro Ala Leu Ala Ala Ala 
1185 1190 

gtg ccc agt atg ace cag ctg ctt gga gat cct cag get 3778 
Val Pro Ser Met Thr Gin Leu Leu Gly Asp Pro Gin Ala 
10 1195 1200 1205 

ggt ate egg cgc aat gtt gca tea get ctg ggc aac ttg 3817 
Gly lie Arg Arg Asn Val Ala Ser Ala Leu Gly Asn Leu 
1210 1215 

gga cct gaa ggt ttg gga gag gag ctg tta cag tgc gaa 3856 
15 Gly Pro Glu Gly Leu Gly Glu Glu Leu Leu Gin Cys Glu 
1220 1225 1230 

gta ccc cag egg etc eta gaa atg gca tgt gga gac ccc 3895 
Val Pro Gin Arg Leu Leu Glu Met Ala Cys Gly Asp Pro 
1235 1240 1245 

20 cag cca aat gtg aag gag get gec etc att gee etc egg 3934 
Gin Pro Asn Val Lys Glu Ala Ala Leu lie Ala Leu Arg 
1250 1255 

age ctg caa cag gag cct ggc ate cat cag gta ctg gtg 3 973 
Ser Leu Gin Gin Glu Pro Gly lie His Gin Val Leu Val 
25 1260 1265 1270 

tec ctg ggt gee agt gag aaa eta tec ttg etc tct ctg 4 012 
Ser Leu Gly Ala Ser Glu Lys Leu Ser Leu Leu Ser Leu 
1275 1280 

ggg aat cag tea ctg cca cac age agt cct agg cct gec 4 051 
3 0 Gly Asn Gin Ser Leu Pro His Ser Ser Pro Arg Pro Ala 
1285 1290 1295 

tct gec aaa cac tgc agg aaa etc att cac etc ctg agg 4 090 
Ser Ala Lys His Cys Arg Lys Leu lie His Leu Leu Arg 
1300 1305 1310 

35 cca gec cat age atg tgatt ccagattcct gcggtccagc 4130 
Pro Ala His Ser Met 
1315 

ctccaacttt ggtgccagct ctttcttatn taatacacaa gcgccaaytc 4180 
aactgagagc taaagagact agaaaagaga taagctgeca actcaactga 4230 
40 gaacaggaaa ctngaagaga tttatatata aagcttcttc cttctcccag 4280 
atgcaggatg ttttcaacca gtaaatttta ttgctgttgg tgccagagaa 4330 
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gagtcccttt cttctctaca tccaggggcc nttttctcca ataatgtgcc 4380 

tttaactcta gggacctgcc tcacggacct tagggaaaaa cctcaacctg 4430 

aaagatctct tcctttctgg agctccttta atcttcccag caggtttttg 4480 

ccttagacgt gctggcccca ggacagtgat gaagacagag cctgtctcag 4530 

5 ctctaggctg tggggatcaa tgccatcagt ccctgttatt gagggattat 4580 

cccttagcca acattcctat ctgtgggtgg gcgtggagag tgtatctttt 4630 

tttggggtgt gtgtgtatat gtgtgtgtgt atgtgtgtgt gtgtttaata 4680 

gttctgtttg taaactcttt taataaaagt tgtgcctcac catacttgaa 4730 

gctcccagga caagggttga gaggctcaac ccctctttca gcttctatgt 4780 

10 ggtgttggag gtgctggtat cgtgttcaca caaaaaaaaa aaaaaaaaaa 4830 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4880 

<210> 2 
<211> 1315 
<212> PRT 
15 <213> Homo sapiens 

<400> 2 

Met Glu Lys Tyr Kis Val Leu Glu Met lie Gly Glu Gly Ser Phe 
15 10 15 

Gly Arg Val Tyr Lys Gly Arg Arg Lys Tyr Ser Ala Gin Val Val 
20 20 25 30 

Ala Leu Lys Phe lie Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu 
35 40 45 

Arg Asn Leu Gin Arg Glu lie Glu lie Met Arg Gly Leu Arg His 
50 55 60 

25 Pro Asn He Val His Met Leu Asp Ser Phe Glu Thx Asp Lys Glu 

65 70 75 

Val Val Val Val Thr Asp Tyr Ala Glu Gly Glu Leu Phe Gin He 
80 85 90 

Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp Gin Val Gin Ala He 
30 95 100 105 

Ala Ala Gin Leu Val Ser Ala Leu Tyr Tyr Leu His Ser His Arg 
110 115 120 

He Leu His Arg Asp Met Lys Pro Gin Asn He Leu Leu Ala Lys 
125 130 135 

35 Gly Gly Gly He Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met 

140 145 150 
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Ser Thr Asn Thr Met Val Leu Thr Ser lie Lys Gly Thr Pro Leu 
155 160 165 

Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp Kis Thr 
170 175 180 

Ala Asp Leu Trp Ser Val Gly Cys lie Leu Tyr Glu Leu Ala Val 
185 190 195 

Gly Thr Pro Pro Phe Tyr Ala Thr Ser lie Phe Gin Leu Val Ser 
200 205 210 

Leu lie Leu Lys Asp Pro Val Arg Trp Pro Ser Thr lie Ser Pro 
215 220 225 

Cys Phe Lys Asn Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg 
230 235 240 

Gin Arg Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe lie Ala 
245 250 255 

Gly His Val Thr lie He Thr Glu Pro Ala Gly Pro Asp Leu Gly 
260 265 270 

Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu Gin Val Leu Lys 
275 280 285 

Asp Glu Gin Ala His Arg Leu Ala Pro Lys Gly Asn Gin Ser Arg 
290 295 300 

He Leu Thr Gin Ala Tyr Lys Arg Met Ala Glu Glu Ala Met Gin 
305 310 315 

Lys Lys His Gin Asn Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys 
320 325 330 

Thr Ser Lys Val Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly 
335 340 345 

Ala Thr Pro Gin Glu Ser Ser Leu Leu Ala Gly He Leu Ala Ser 
350 355 360 

Glu Leu Lys Ser Ser Trp Ala Lys Ser Gly Thr Gly Glu Val Pro 
365 370 375 

Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro Asp Cys Glu Arg Ala 
380 385 390 

Phe Pro Glu Glu Arg Pro Glu Val Leu Gly Gin Arg Ser Thr Asp 
395 400 405 

Val Val Asp Leu Glu Asn Glu Glu Pro Asp Ser Asp Asn Glu Trp 
410 415 420 

Gin His Leu Leu Glu Thr Thr Glu Pro Val Pro He Gin Leu Lys 
425 430 435 
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Ala Pro Leu Thr Leu Leu Cys Asn Pro Asp Phe Cys Gin Arg He 
440 445 450 

Gin Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys Gly He 
455 -460 465 

Leu Glu Gly Ala Ser His He Leu Pro Ala Phe Arg Val Leu Ser 
470 475 480 

Ser Leu Leu Ser Ser Cys Ser Asp Ser Val Ala Leu Tyr Ser Phe 
485 490 495 

Cys Arg Glu Ala Gly Leu Pro Gly Leu Leu Leu Ser Leu Leu Arg 
500 505 510 

His Ser Gin Glu Ser Asn Ser Leu Gin Gin Gin Ser Trp Tyr Gly 
515 520 525 

Thr Phe Leu Gin Asp Leu Met Ala Val He Gin Ala Tyr Phe Ala 
530 535 540 

Cys Thr Phe Asn Leu Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin 
545 550 555 

Val Phe Gin Glu Ala Ala Asn Leu Phe Leu Asp Leu Leu Gly Lys 
560 565 r jl0 

Leu Leu Ala Gin Pro Asp Asp Ser Glu Gin Thr Leu Arg Arg Asp 
575 580 585 

Ser Leu Met Cys Phe Thr Val Leu Cys Glu Ala Met Asp Gly Asn 
590 595 600 

S-zr Arg Ala He Ser Lys Ala Phe Tyr Ser Ser Leu Leu Thr Thr 
605 610 615 

Gin Gin Val Val Leu Asp Gly Leu Leu His Gly Leu Thr Val Pro 
620 625 630 

Gin Leu Pro Val His Thr Pro Gin Gly Ala Pro Gin Val Ser Gin 
635 640 645 

Pro Leu Arg Glu Gin Ser Glu Asp He Pro Gly Ala He Ser Ser 
650 655 660 

Ala Leu Ala Ala He Cys Thr Ala Pro Val Gly Leu Pro Asp Cys 
665 670 675 

Trp Asp Ala Lys Glu Gin Val Cys Trp His Leu Ala Asn Gin Leu 
680 685 690 

Thr Glu Asp Ser Ser Gin Leu Arg Pro Ser Leu He Ser Gly Leu 
695 700 705 

Gin His Pro He Leu Cys Leu His Leu Leu Lys Val Leu Tyr Ser 
710 715 720 
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Cys Cys Leu Val Ser Glu Gly Leu Cys Arg Leu Leu Gly Gin Glu 
725 730 735 

Pro Leu Ala Leu Glu Ser Leu Phe Met Leu He Gin Gly Lys Val 
740 745 750 

5 Lys Val Val Asp Trp Glu Glu Ser Thr Glu Val Thr Leu Tyr Phe 



755 



760 



Leu 



Ser Leu Leu Val Phe Arg Leu Gin Asn Leu Pro Cys Gly Met 



770 



775 780 



Glu Lys Leu Gly Ser Asp Val Ala Thr Leu Phe Thr His Ser His 
10 ' 7 85 



790 795 



Val Val Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu 
800 805 810 

Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met Glu Trp Met 
815 820 825 

15 Ala Ala Ala Thr His Ala Leu Ser Ala Pro Ala Glu Val Arg Leu 

830 835 840 



Thr Pro Pro Gly Ser Cys Gly Phe Tyr Asp Gly Leu Leu He Leu 
845 



850 855 



Leu Leu Gin Leu Leu Thr Glu Gin Gly Lys Ala Ser Leu He Arg 
20 860 



865 870 



Asp Met Ser Ser Ser Glu Met Trp Thr Val Leu Trp His Arg Phe 
875 880 885 

Ser Met Val Leu Arg Leu Pro Glu Glu Ala Ser Ala Gin -llu Gly 
890 895 90° 

25 Glu Leu Ser Leu Ser Ser Pro Pro Ser Pro Glu Pro Asp Trp Thr 

905 910 915 

Leu lie Ser Pro Gin Gly Met Ala Ala Leu Leu Ser Leu Ala Met 
920 925 930 

Ala Thr Phe Thr Gin Glu Pro Gin Leu Cys Leu Ser Cys Leu Ser 
30 935 940 945 

Gin His Gly Ser He Leu Met Ser He Leu Lys His Leu Leu Cys 
950 955 960 

Pro Ser Phe Leu Asn Gin Leu Arg Gin Ala Pro His Gly Ser Glu 
965 970 975 

35 Phe Leu Pro Val Val Val Leu Ser Val Cys Gin Leu Leu Cys Phe 

980 985 990 



Pro Phe Ala Leu Asp Met Asp Ala Asp Leu Leu He Val Val Leu 
995 1000 10° 5 
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Ala Asp Leu Arg Asp Ser Glu Val Ala Ala His Leu Leu Gin Val 
1010 1015 1020 

Cys Cys Tyr His Leu Pro Leu Met Gin Val Glu Leu Pro lie Ser 
1025 1030 1035 

Leu Leu Thr Arg Leu Ala Leu Met Asp Pro Thr Ser Leu Asn Gin 
1040 1045 1050 

Phe Val Asn Thr Val Ser Ala Ser Pro Arg Thr lie Val Ser Phe 
1055 1060 1065 

Leu Ser Val Ala Leu Leu Ser Asp Gin Pro Leu Leu Thr Ser Asp 
1070 1075 1080 

Leu Leu Ser Leu Leu Ala His Thr Ala Arg Val Leu Ser Pro Ser 
1085 1090 1095 

His Leu Ser Phe lie Gin Glu Leu Leu Ala Gly Ser Asp Glu Ser 
1100 1105 mo 

Tyr Arg Pro Leu Arg Ser Leu Leu Gly His Pro Glu Asn Ser Val 
1115 1120 H25 

Arg Ala His Thr Tyr Arg Leu Leu Gly His Leu Leu Gin His Ser 
1130 1135 1140 

Met Ala Leu Arg Gly Ala Leu Gin Ser Gin Ser Gly Leu Leu Ser 
1145 1150 H55 

Leu Leu Leu Leu Gly Leu Gly Asp Lys Asp Pro Val Val Arg Cys 
1160 H65 1170 

Ser Ala Ser Phe Ala Val Gly Asn Ala Ala Tyr Gin Ala Gly Pro 
1175 1180 1185 

Leu Gly Pro Ala Leu Ala Ala Ala Val Pro Ser Met Thr Gin Leu 
1190 1195 1200 

Leu Gly Asp Pro Gin Ala Gly lie Arg Arg Asn Val Ala Ser Ala 
1205 1210 1215 

Leu Gly Asn Leu Gly Pro Glu Gly Leu Gly Glu Glu Leu Leu Gin 
1220 1225 1230 

Cys Glu Val Pro Gin Arg Leu Leu Glu Met Ala Cys Gly Asp Pro 
1235 1240 1245 

Gin Pro Asn Val Lys Glu Ala Ala Leu He Ala Leu Arg Ser Leu 
1250 1255 1260 

Gin Gin Glu Pro Gly He His Gin Val Leu Val Ser Leu Gly Ala 
1265 1270 1275 



Ser Glu Lys Leu Ser Leu Leu Ser Leu Gly Asn Gin Ser Leu Pro 
1280 1285 1290 
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His Ser Ser Pro Arg Pro Ala Ser Ala Lys His Cys Arg Lys Leu 
1295 1300 1305 

He His Leu Leu Arg Pro Ala His Ser Met 
1310 1315 

5 <210> 3 
<211> 193 
<212> DNA 

<213> Homo sapiens 
<400> 3 

10 cccgggctat gagcaccaat acaatggtgc tgacatccat caaaggcaca 50 
ccactctata tgtctccaga gctggtggag gagcgaccat acgaccacac 100 
agcggacctc tggtctgttg gctgcatact atatgaactg gcagtaggca 150 
cccctccctt ctaatgctac aagcatcttt cagctggtca gcc 193 

<210> 4 
15 <211> 5125 
<212> DNA 

<213> Homo sapiens 
<400> 4 

cccacgcgtc cgcccacgcg tccggggcgt cccagatgtt gtggaactgt 50 

20 ccctggatct atagctcttc accgtctcta ctttcttcct tctaagagat 100 

cctgaaacct ctgtc atg gaa aag tac cac gtg ttg gag 139 

Met Glu Lys Tyr His Val Leu Glu 
1 5 

atg att gga gaa ggc tct ttt ggg agg gtg tac aag ggt 17 8 
25 Met He Gly Glu Gly Ser Phe Gly Arg Val Tyr Lys Gly 
10 " 20 

cga aga aaa tac agt get cag gtc gtg gcc ctg aag ttc 217 
Arg Arg Lys Tyr Ser Ala Gin Val Val Ala Leu Lys Phe 
25 30 

30 ate cca aaa ttg ggg cgc tea gag aag gag ctg agg aat 256 
He Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu Arg Asn 
35 40 45 

ttg caa cga gag att gaa ata atg egg ggt ctg egg cat 295 
Leu Gin Arg Glu He Glu He Met Arg Gly Leu Arg His 
35 50 55 60 

ccc aac att gtg cat atg ctt gac age ttt gaa act gat 334 
Pro Asn He Val His Met Leu Asp Ser Phe Glu Thr Asp 
65 70 



40 



aaa gag gtg gtg gtg gtg aca gac tat get gag gga gag 373 
Lys Glu Val Val Val Val Thr Asp Tyr Ala Glu Gly Glu 
75 80 85 
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etc ttt cag ate eta gaa gat gac gga aaa ctt cct gaa 412 
Lau Phe Gin He Leu Glu Asp Asp Gly Lys Leu Pro Glu 
90 95 

gac cag gtt cag gec att get gec cag ttg gtg tea gec 451 
Asp Gin Val Gin Ala He Ala Ala Gin Leu Val Ser Ala 
100 105 no 

ctg tac tat ctg cat tec cac cgc ate eta cac cga gat 490 
Leu Tyr Tyr Leu His Ser His Arg He Leu His Arg Asp 
H5 120 125 

atg aag cct cag aac ate etc etc gec aag ggt ggt ggc 529 
Met Lys Pro Gin Asn He Leu Leu Ala Lys Gly Gly Gly 
130 135 

ate aag etc tgt gac ttt gga ttt gec egg get atg age 568 
He Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met Ser 



15 140 145 



150 



ace aat aca atg gtg ctg aca tec ate aaa ggc aca cca 607 
Thr Asn Thr Met Val Leu Thr Ser He Lys Gly Thr Pro 
155 160 

etc tat atg tct cca gag ctg gtg gag gag cga cca tac 646 
Leu Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr 
165 170 i7 5 

gac cac aca gcg gac etc tag tct gtt ggc tgc ata eta 685 
Asp His Thr Ala Asp Leu Trp Ser Val Gly Cys He Leu 



180 185 



190 



tat gaa ctg gca gta ggc ace cct ccc ttc tat get aca 724 
Tyr Glu Leu Ala Val Gly Thr Pro Pro Phi Tyr Ala Thr 
195 200 

age ate ttt cag ctg gtc age etc att etc aag gac cct 763 
Ser He Phe Gin Leu Val Ser Leu He Leu Lv S Asp Pro 
30 205 210 



215 



gtg cgc tgg ccc tea acc ate agt ccc tgc ttt aag aac 
Val Arg Trp Pro Ser Thr He Ser Pro Cys Phe Lys Asn 
220 225 



802 



ttc ctg cag gga ctg etc acc aaa gac cca egg cag cga 841 
Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg Gin Arg 



230 235 



240 



ctg tec tgg cca gac etc tta tat cac ccc ttt att get 
Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala 
245 250 



880 

255 



ggt cat gtc acc ata ata act gag cca gca ggc cca gat 919 
Gly His Val Thr He He Thr Glu Pro Ala Gly Pro Asp 
260 265 

tfc 9 ggg acc cca ttc acc age cgc eta ccc cca gaa ctt 958 
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Leu Gly Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu 
270 275 280 

cag gtc eta aag gac gaa cag gec cat egg ttg gee ccc 997 
Gin Val Leu Lys Asp Glu Gin Ala His Arg Leu Ala Pro 
5 285 290 

aag ggt aat cag tct egc ate ttg act cag gee tat aaa 1036 
Lys Gly Asn Gin Ser Arg He Leu Thr Gin Ala Tyr Lys 
295 300 305 

egc atg get gag gag gee atg cag aag aaa cat cag aac 1075 
10 Arg Met Ala Glu Glu Ala Met Gin Lys Lys Hxs Gin Asn 
310 315 320 

aca gga cct gee ctt gag caa gag gac aag acc age aag 1114 
Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys Thr Ser Lys 
325 330 

15 gtg get cct ggc aca gee cct ctg ccc aga etc ggg gec 1153 
Val Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly Ala 
335 340 345 

act cct cag gaa tea age etc ctg gec ggg ate tta gee 1192 
Thr Pro Gin Glu Ser Ser Leu Leu Ala Gly He Leu Ala 
20 350 355 

tea gaa ttg aag age age tgg get aaa tea ggg act gga 1231 
Ser Glu Leu Lys Ser Ser Trp Ala Lys Ser Gly Thr Gly 
360 ' 365 370 

gag gtg ccc tct gca cct egg gaa aac egg acc acc cca 1270 
25 Glu Val Pro Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro 



30 



375 



380 385 



gat tgt gaa cga gca ttc cca gag gag agg cca gag gtg 1309 
Asp Cys Glu Arg Ala Phe Pro Glu Glu Arg Pro Glu Val 
390 395 

ctg ggc cag egg age act gat gta gtg gac ctg gaa aat 1348 
Leu Gly Gin Arg Ser Thr Asp Val Val Asp Leu Glu Asn 
400 405 410 ■ 

gag gag cca gac agt gac aat gag tgg cag cac ctg eta 1387 
Glu Glu Pro Asp Ser Asp Asn Glu Trp Gin His Leu .Leu 
35 415 420 

gag acc act gag cct gtg cct att caa ctg aag get cct 1426 
Glu Thr Thr Glu Pro Val Pro He Gin Leu Lys Ala Pro 
425 430 435 

etc ace ttg ctg tgt aat cct gac ttc tgc cag egc ate 1465 
4 0 Leu Thr Leu Leu Cys Asn Pro Asp Phe Cys Gin Arg He 

440 - 445 450 ... 

cag agt cag ctg cat gaa get gga ggg cag ate ctg aaa 1504 
Gin Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys 
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ggc ate 
Gly lie 
465 

5 egg gtc 
Arg Val 



gtt gee 
Val Ala 
10 490 

999 ctg 
Gly Leu 



aac age 
15 Asn Ser 



cag gac 
Gin Asp 
530 

2 0 acc ttc 
Thr Phe 



cag gtg 
Gin Val 
25 555 

ttg ggg 
Leu Gly 



act ttg 
30 Thr Leu 



tgc gaa 
Cys Glu 
595 

35 gec ttt 
Ala Phe 



ttg gat 
Leu Asp 
40 620 

cct gtc 
Pro Val 



455 

ttg gag ggt 
Leu Glu Gly 



ctg age agt 
Leu Ser Ser 
480 

ttg tat tec 
Leu Tyr Ser 



ctg ctg agt 
Leu Leu Ser 
505 

etc cag cag 
Leu Gin Gin 
520 

ctg atg get 
Leu Met Ala 



aat ctg gag 
Asn Leu Glu 
545 

ttt cag gag 
Phe Gin Glu 



aaa ctg ctg 
Lys Leu Leu 
570 

cag agg gac 
Gin Arg Asp 
585 

gee atg gat 
Ala Met Asp 



tac tec age 
Tyr Ser Ser 
610 

999 ctc ctt 
Gly Leu Leu 



cac act ccc 
His Thr Pro 
635 



get tec cac 
Ala Ser His 
4 70 

ctt ctc tec 
Leu Leu Ser 



ttc tgc egg 
Phe Cys Arg 
495 

eta ctc agg 
Leu Leu Arg 
510 

caa tct tgg 
Gin Ser Trp 



gtg att cag 
Val He Gin 
535 

agg age cag 
Arg Ser Gin 



get gee aac 
Ala Ala Asn 
560 

gee caa cca 
Ala Gin Pro 
575 

age ctt atg 
Ser Leu Met 



ggg aac age 
Gly Asn Ser 
600 

ttg ctg acg 
Leu Leu Thr 



cat ggc ttg 
His Gly Leu 
625 

caa ggt aac 
Gin Gly Asn 
640 



460 

ate ctg cct 
He Leu Pro 



age tgc agt 
Ser Cys Ser 
485 

gag gca ggg 
Glu Ala Gly 
500 

cac agt cag 
His Ser Gin 



tat ggg acc 
Tyr Gly Thr 
525 

gec tac ttt 
Ala Tyr Phe 



aca agt gac 
Thr Ser Asp 
550 

ctt ttt ctg 
Leu Phe Leu 
565 

gat gac tct 
Asp Asp Ser 



tgc ttt act 
Cys Phe Thr 
590 

egg gee ate 
Arg Ala He 



aca cag cag 
Thr Gin Gin 
615 

aca gtt cca 
Thr Val Pro 
630 

cag agt gga 
Gin Ser Gly 



gca ttc 1543 
Ala Phe 
475 

gat tct 1582 
Asp Ser 



ctt cct 1621 
Leu Pro 



gag age 1660 
Glu Ser 
515 

ttc tta 1699 
Phe Leu 



gec tgt 1738 
Ala Cys 
540 

age ctg 1777 
Ser Leu 



gac ctg 1816 
Asp Leu 



gag cag 18 55 
Glu Gin 
580 

gtc ctg 1894 
Val Leu 



tec aaa 1933 
Ser Lys 
605 

gtt gtc 1972 
Val Val 



cag ctc 2011 
Gin Leu 



gaa ggg 2050 
Glu Gly 
645 
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agg ttc tct t gacttacttg ttgcataggt caggctccgc 2090 
Arg Phe Ser 
648 

tctttctatt gccatcacct agatcgcacc tggcatttag taggtgctca 2140 

5 ataaataact gtgaactgag agaatgaatg gggatctgag ggaaacaaac 2190 

agacctcatc ctgcattctt cccactccct taggttccct actcctgctg 2240 

ccatgtcggt gagtactggt gctattgtct agggcaagag cctcaggcct 2290 

ttgg agt tac tct ttg ctt ttc tec aca gga gec ccg 2327 
Ser Tyr Ser Leu Leu Phe Ser Thr Gly Ala Pro 
10 1 5 10 

caa gtg age cag cca ctg cga gag cag agt gag gat ata 2366 
Gin Val Ser Gin Pro Leu Arg Glu Gin Ser Glu Asp He 
15 20 

cct gga gec att tec tct gee ctg gca gec ata tgc act 2405 
15 Pro Gly Ala He Ser Ser Ala Leu Ala Ala He Cys Thr 
25 30 35 

get cct gtg gga ctg ccc gac tgc tgg gat gec aag gag 2444 
Ala Pro Val Gly Leu Pro Asp Cys Trp Asp Ala Lys Glu 
40 45 50 

20 cag gtc tgt tgg cat ttg gca aat cag eta act gaa gac 2483 
Gin Val Cys Trp His Leu Ala Asn Gin Leu Thr Glu Asp 
55 60 

age age cag etc agg cca tec etc ate tct ggc ctg cag 2522 
Ser Ser Gin Leu Aro~ Pro Ser Leu He Ser Gly Leu Gin 
25 65 70 75 

cat ccc ate ctg tgc ctg cac ctt etc aag gtt eta tac 2561 
His Pro He Leu Cys Leu His Leu Leu Lys Val Leu Tyr 
80 85 

tec tgc tgc ctt gtc agt gag ggc ctg tgc cgt ctt ctg 2600 
30 Ser Cys Cys Leu Val Ser Glu Gly Leu Cys Arg Leu Leu 
90 95 100 

ggg cag gag ccc ctg gec ttg gaa tec ctg ttt atg ttg 2639 
Gly Gin Glu Pro Leu Ala Leu Glu Ser Leu Phe Met Leu 
105 HO H5 

35 att cag ggc aag gta aaa gta gta gat tgg gaa gag tct 2678 
He Gin Gly Lys Val Lys Val Val Asp Trp Glu Glu Ser 
120 125 

act gaa gtg aca etc tac ttc etc tec ctt ctt gtc ttt 2717 
Thr Glu Val Thr Leu Tyr Phe Leu Ser Leu Leu Val Phe 
40 130 135 140 

egg etc caa aac ctg cct tgt gga atg gag aag eta ggc 2756 
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Arg Leu Gin Asn Leu Pro Cys Gly Met Glu Lys Leu Gly 
145 150 

agt gac gtt get act etc ttt acc cat teg cat gtc gtc 2795 
Ser Asp Val Ala Thr Leu Phe Thr His Ser His Val Val 



15 



25 



30 



40 



155 160 



165 



tct ctt gtg agt gca gca gec tgt eta ttg gga cag ctt 2834 
Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu 
170 175 i8o 

ggt cag caa ggg gtg acc ttt gac etc cag ccc atg gaa 2873 
10 Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met Glu 

185 190 

tgg atg get gca gec aca cat gec ttg tct gec cct gca 2912 
Trp Met Ala Ala Ala Thr His Ala Leu Ser Ala Pro Ala 
195 200 



205 



gag gtt egg ttg act cca cca ggt agt tgt gga ttc tat 2951 
Glu Val Arg Leu Thr Pro Pro Gly Ser Cys Gly Phe Tyr 
210 215 

gat ggc etc ctt ate ctt ctg ttg cag etc etc act gag 2990 
Asp Gly Leu Leu lie Leu Leu Leu Gin Leu Leu Thr Glu 
20 220 225 2 30 

cag ggg aag get age eta ate agg gat atg tec agt tea 3029 
Gin Gly Lys Ala Ser Leu lie Arg Asp Met Ser Ser Ser 
235 240 245 

gaa atg tgg acc gtt ttg tgg cac cgc ttc tec atg gtc 3068 
Glu Met Trp Thr Val Leu Trp His Arg Phe Ser Met Val 
250 255 

ctg agg etc ccc gag gag gca tct gca cag gaa ggg gag 3107 
Leu Arg Leu Pro Glu Glu Ala Ser Ala Gin Glu Gly Glu 
250 265 270 

ctt teg eta tec agt cca cca age cct gag cca gac tgg 3146 
Leu Ser Leu Ser Ser Pro Pro Ser Pro Glu Pro Asp Trp 
275 280 



aca ctg att tct ccc cag ggc atg gca gec ctg ctg age 3185 
Thr Leu lie Ser Pro Gin Gly Met Ala Ala Leu Leu Ser 
35 285 290 2 95 

ctg gec atg gec acc ttt acc cag gag ccc cag tta tgc 3224 
Leu Ala Met Ala Thr Phe Thr Gin Glu Pro Gin Leu Cys 
300 305 310 



ctg age tgc ctg tec cag cat gga agt ate etc atg tec 3263 
Leu Ser Cys Leu Ser Gin His Gly Ser He Leu Met Ser 
315 . 320 

ate ctg aag cat ctg ctt tgc ccc age ttc ctg aat caa 3302 
He Leu Lys His Leu Leu Cys Pro Ser Phe Leu Asn Gin 
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325 

ctg cgc 
Leu Arg 



5 gtg gtg 
Val Val 
350 

gcg ctg 
Ala Leu 

10 

gcc gac 
Ala Asp 



cag gtc 
15 Gin Val 
390 

ctg ccc 
Leu Pro 



20 ccc ccc 
Pro Thr 
415 

tec cct 
Ser Pro 

25 

ctg agt 
Leu Ser 



ctg ctg 
30 Leu Leu 
455 

ttg tec 
Leu Ser 



35 tec tat 
Ser Tyr 
480 

aat tct 
Asn Ser 

40 

ttg etc 
Leu Leu 



cag gcg cct 
Gin Ala Pro 
340 

etc tct gtc 
Leu Ser Val 



gac atg gat 
Asp Met Asp 
365 

etc agg gac 
Leu Arg Asp 
380 

tgc tgc tac 
Cys Cys Tyr 



ate age ctt 
lie Ser Leu 
405 

tct etc aac 
Ser Leu Asn 



aga acc ate 
Arg Thr lie 
430 

gac cag cca 
Asp Gin Pro 
445 

gcc cat act 
Ala His Thr 



ttt ate caa 
Phe He Gin 
470 

egg ccc ctg 
Arg Fro Leu 



gtg egg gca 
Val Arg Ala 
495 

caa cac age 
Gin His Ser 
510 



330 

cat ggg tct 
His Gly Ser 



tgc cag etc 
Cys Gin Leu 
355 

get gac etc 
Ala Asp Leu 
370 

tea gaa gtt 
Ser Glu Val 



cat ctt ccg 
His Leu Pro 
395 

etc aca cgc 
Leu Thr Arg 



cag ttt gtg 
Gin Phe Val 
420 

gtc teg ttt 
Val Ser Phe 
435 

ctg ttg acc 
Leu Leu Thr 



gcc agg gtc 
Ala Arg Val 
460 

gag ctt ctg 
Glu Leu Leu 



cgc age etc 
Arg Ser Leu 
485 

cac act tat 
His Thr Tyr 
500 

atg gcc ctg 
Met Ala Leu 



gag ttt etc 
Glu Phe Leu 
345 

ctt tgc ttc 
Leu Cys Phe 
360 

ctt ata gtt 
Leu He Val 



gca gcc cat 
Ala Ala His 
385 

ttg atg caa 
Leu Met Gin 



ctg gcc etc 
Leu Ala Leu 
410 

aac aca gtg 
Asn Thr Val 
425 

etc tea gtt 
Leu Ser Val 



tec gac ctt 
Ser Asp Leu 
450 

ctg tct ccc 
Leu Ser Pro 



get ggc tct 
Ala Gly Ser 
475 

ctg ggc cac 
Leu Gly His 
490 

agg etc ctg 
Arg Leu Leu 



cgt ggg gca 
Arg. Gly Ala 
515 
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335 

cct gtc 3341 
Pro Val 



ccc ttt 3380 
Pro Phe 



gtc ttg 3419 
Val Leu 
375 

ctg ctg 3458 
Leu Leu 



gtg gag 34 97 
Val Glu 
400 

atg gat 3536 
Met Asp 



tct gcc 3575 
Ser Ala 



gcc etc 3614 
Ala Leu 
440 

etc tct 3653 
Leu Ser 



age cac 3692 
Ser His 
465 

gat gaa 3731 
Asp Glu 



cca gag 3770 
Pro Glu 



gga cac 3809 
Gly His 
505 

ctg cag 3848 
Leu Gin 
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Ser Gin tit rT ^ ^ ^ Ct9 Ctg C " 399 ctt 3887 

Ser Gin Ser Gly Leu Leu Ser Leu Leu Leu Leu Gly Leu 



525 



530 



It IT ^ 9t9 C " tgC 33t gCC a 9 c "t 3926 

Gly Asp Lys Asp Pro Val Val Arg Cys Ser Ala Ser Phe 



535 540 



Si J 6 ? C39 9Ct g9t CCt ct * 9^ 3965 

Ala Val Gly Asn Ala Ala Tyr Gin Ala Gly Pro Leu Gly 

545 55 ° 555 

cct gcc ctg gca get gca gtg ccc agt atg acc cag ctg 4 004 
Pro Ala Leu Ala Ala Ala Val Pro Ser Met Thr Gin Leu 



560 565 



570 



ctt gga gat cct cag get ggt ate egg ege aat gtt gea 4043 
Leu Gly Asp Pro Gin Ala Gly He Arg Arg Asn Val Ala 
575 580 

tea get ctg gge aac ttg gga cct gaa ggt ttg gga gag 40 82 
oer Ala Leu Gly Asn Leu Gly Pro Glu Gly Leu Gly Glu 
585 59 ° 595 

-u t C ! 9 ^ t9C 933 9ta CCC Ca9 Cgg CtC cta ^a 4121 

"™ LSU LeU Gln ^ Glu Val Pro Gin Arg Leu Leu Glu 
600 



605 



atg gca tgt gga gac ccc cag cca aat gtg sag qaq act 4160 
Met Ala Cys Gly Asp Pro Gin Pro Asn Val Lys Glu Ala 
610 615 620 

gcc etc att gcc etc egg age ctg caa cag gag cct ggc 4199 
Ala Leu He Ala Leu Arg Ser Leu Gin Gin Glu Pro Gly 
625 63 ° 635 

ill His f ? T Ct9 9t ? Ct9 9gt 9CC agt ga 9 aaa 423 * 

He His Gin Val Leu Val Ser Leu Gly Ala Ser Glu Lys 

6 40 645 

eta tec ttg etc tct ctg ggg aat cag tea ctg cca cae 4277 
Leu Ser Leu Leu Ser Leu Gly Asn Gin Ser Leu Pro His 
650 65 5 660 

age agt cct agg cct gee tct gee aaa cae tgc agg aaa 4316 
Ser Ser Pro Arg Pro Ala Ser Ala Lys His Cys Arg Lys 
665 670 

etc att cae etc ctg agg cca gcc cat age atg t 4 350 
Leu He His Leu Leu Arg Pro Ala His Ser Met 
675 680 685 

gattccagat tectgeggte cagectccaa ctttggttgc eagctctttc 4400 
ttattctact acaeaagecg ccaacteaae tgagagctaa agagactaga 44 50 
aaagagataa getgecaaet caaetgagaa eaagaaacta gaagagattt 4 500 
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atatataaag cttcttcctt ctcccagatg caggatgttt tcaaccagta 4550 

aattttattg ctgttggtgc cagagaagag tcctttcttc tctacatcca 4600 

ggggcctttt ctccaataat gtgcctttaa ctctagggac ctgcctcacg 4650 

gaccttaggg aaaaacctca acctgaaaga tctcttcctt tctggagctc 47 00 

5 ctttaatctt cccagcaggt ttttgcctta gacgtgctgg ccccaggaca 4750 

gtgatgaaga cagagcctgt ctcagctcta ggctgtgggg atcaatgcca 48 00 

tcagtccctg ttattgaggg attatccctt agccaacatt cctatctgtg 4850 

ggtgggcgtg gagagtgtat ctttttttgg ggtgtgtgtg tatatgtgtg 4 900 

tgtgtatgtg tgtgtgtgtt taatagttct gtttgtaaac tcttttaata 4950 

10 aaagttgtgc ctcaccatac ttgaagctcc caggacaagg gttgagaggc 5000 

tcaacccctc tttcagcttc tatgtggtgt tggaggtgct ggtatcgtgt 5050 

tcacacaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5100 

aaaaaaaaaa aaaaaaaaaa aaaaa 5125 

<210> 5 
15 <211> 648 
<2i2> FRT 

<213> Homo sapiens 
<400> 5 

Met Glu Lys Tyr His Val Leu Glu Met lie Gly Glu Gly Ser Phe 
-.0 1 5 10 15 

Gly Arg Val Tyr Lys Gly Arg Arg Lys Tyr Ser Ala Gin Val Val 
20 25 30 

Ala Leu Lys Phe He Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu 
35 40 45 

25 Arg Asn Leu Gin Arg Glu lie Glu lie Met Arg Gly Leu Arg His 

50 55 60 

Pro Asn He Val His Met Leu Asp Ser Phe Glu Thr Asp Lys Glu 
65 70 75 

Val Val Val Val Thr Asp Tyr Ala Glu Gly Glu Leu Phe Gin lie 
30 ' 80 85 90 

Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp Gin Val Gin Ala lie 
95 100 1° 5 

Ala Ala Gin Leu Val Ser Ala Leu Tyr Tyr Leu His Ser His Arg 
110 H5 120 

35 lie Leu His Arg Asp Met Lys Pro Gin Asn lie Leu Leu Ala Lys 
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125 130 135 

Gly Gly Gly lie Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met 
140 145 150 

Ser Thr Asn Thr Met Val Leu Thr Ser He Lys Gly Thr Pro Leu 
5 155 160 165 

Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp His Thr 
170 175 180 

Ala Asp Leu Trp Ser Val Gly Cys He Leu Tyr Glu Leu Ala Val 
185 190 195 

10 Gly Thr Pro Pro Phe Tyr Ala Thr Ser He Phe Gin Leu Val Ser 

200 205 210 

Leu He Leu Lys Asp Pro Val Arg Trp Pro Ser Thr He Ser Pro 
215 220 225 

Cys Phe Lys Asn Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg 
15 230 235 240 

Gin Arg Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala 
245 250 255 

Gly His Val Thr He lie Thr Glu Pro Ala Gl\ Pro Asp Leu Gly 
260 265 270 

20 Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu Gin Val Leu Lys 

275 280 285 

Asp Glu Gin Ala His Arg Leu Ala Pro Lys Gly Asn Gin Ser Arg 
290 295 300 

He Leu Thr Gin Ala Tyr Lys Arg Met Ala Glu Glu Ala Met Gin 
25 305 310 315 

Lys Lys His Gin Asn Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys 
320 325 330 

Thr Ser Lys Val Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly 
335 340 345 

30 Ala Thr Pro Gin Glu Ser Ser Leu Leu Ala Gly He Leu Ala Ser 

350 355 360 

Glu Leu Lys Ser Ser Trp Ala Lys Ser Gly Thr Gly Glu Val Pro 
365 370 375 

Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro Asp Cys Glu Arg Ala 
35 380 385 390 

Phe. Pro. Glu Glu Arg Pro Glu .Val Leu Gly Gin Arg Ser. Thr Asp 
395 400 405 

Val Val Asp Leu Glu Asn Glu Glu Pro Asp Ser Asp Asn Glu Trp 
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410 415 420 

Gin His Leu Leu Glu Thr Thr Glu Pro Val Pro lie Gin Leu Lys 
425 430 435 

Ala Pro Leu Thr Leu Leu Cys Asn Pro Asp Phe Cys Gin Arg lie 
5 440 445 450 

Gin Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys Gly He 
455 460 465 

Leu Glu Gly Ala Ser His He Leu Pro Ala Phe Arg Val Leu Ser 
470 475 480 

10 Ser Leu Leu Ser Ser Cys Ser Asp Ser Val Ala Leu Tyr Ser Phe 

485 490 495 

Cys Arg Glu Ala Gly Leu Pro Gly Leu Leu Leu Ser Leu Leu Arg 
500 505 510 

His Ser Gin Glu Ser Asn Ser Leu Gin Gin Gin Ser Trp Tyr Gly 
15 515 520 525 

Thr Phe Leu Gin Asp Leu Met Ala Val He Gin Ala Tyr Phe Ala 
530 535 540 

Cys Thr Phe Asn Leu Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin 
545 550 555 

20 Val Phe Gin Glu Ala Ala Asn Leu Phe Leu Asp Leu Leu Gly Lys 

560 565 570 

Leu Leu Ala Gin Pro Asp Asp Ser Glu Gin Thr Leu Gin Arg Asp 
575 580 585 

Ser Leu Met Cys Phe Thr Val Leu Cys Glu Ala Met Asp Gly Asn 
25 590 595 600 

Ser Arg Ala He Ser Lys Ala Phe Tyr Ser Ser Leu Leu Thr Thr 
605 610 615 

Gin Gin Val Val Leu Asp Gly Leu Leu His Gly Leu Thr Val Pro 
620 625 630 

30 Gin Leu Pro Val His Thr Pro Gin Gly Asn Gin Ser Gly Glu Gly 

635 640 645 

Arg Phe Ser 
648 

<210> 6 
35 <211> 5252 
<212> DNA 

<213> Homo sapiens. .. . . . .. 

<400> 6 

ggagcttgga gctcctaggc tgggggcgtc ccagatgttg tggaactgtc 50 
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cctggatcta tagctcttca ccgtctctac tttcttcctt ctaagagatc 100 

ctgaaacctc tgtc atg gaa aag tac cac gtg ttg gag atg 141 
Met Glu Lys Tyr His Val Leu Glu Met 
1 5 



att gga gaa ggc tct ttt ggg agg gtg tac aag ggt cga 180 
He Gly Glu Gly Ser Phe Gly Arg Val Tyr Lys Gly Arg 
10 15 20 



aga aaa tac agt get cag gtc gtg gec ctg aag ttc ate 219 
Arg Lys Tyr Ser Ala Gin Val Val Ala Leu Lys Phe He 

30 35 



10 25 



cca aaa ttg ggg C gc tea gag aag gag ctg agg aat ttg 258 
Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu Arg Asn Leu 



40 



45 



15 



caa cga gag att gaa ata atg egg ggt ctg egg cat ccc 2 97 
Gin Arg Glu He Glu He Met Arg Gly Leu Arg His Pro 
50 55 60 

aac att gtg cat atg ctt gac age ttt gaa act gat aaa 336 
Asn He Val His Met Leu Asp Ser Phe Glu Thr Asp Lys 
6 5 7 0 

gag gtg gtg gtg gtg aca gac tat get gag gga gag etc 375 
Glu Val Val val Val Thr Asp Tyr Ala Glu Gly Glu Leu 
75 80 85 

ttt cag ate eta gaa gat gac gga aaa ctt cct gaa gac 414 
Phe Gin He Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp 
25 90 



20 



30 



35 



95 



100 



cag gtt cag gee att get gee cag ttg gtg tea gee ctg 453 
Gin Val Gin Ala He Ala Ala Gin Leu Val Ser Ala Leu 
105 110 

tac tat ctg cat tec cac cgc ate eta cac cga gat atg 4 92 
Tyr Tyr Leu His Ser His Arg He Leu His Arg Asp Met 
115 120 125 

aag cct cag aac ate etc etc gee aag ggt ggt ggc ate 531 
Lys Pro Gin Asn He Leu Leu Ala Lys Gly Gly Gly He 
130 135 



aag etc tgt gac ttt gga ttt gec egg get atg age acc 570 
Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met Ser Thr 
140 "5 150 

aat aca atg gtg ctg aca tec ate aaa ggc aca cca etc 609 
Asn Thr Met Val Leu Thr Ser He Lys Gly Thr Pro Leu 
40 155 160 165 



tat atg tct cca gag ctg gtg gag gag cga cca tac aac 64 8 
Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp 
170 175 
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cac aca gcg gac etc tgg tct gtt ggc tgc ata eta tat 687 
His Thr Ala Asp Leu Trp Ser Val Gly Cys lie Leu Tyr 
180 185 190 

gaa ctg gca gta ggc acc cct ccc ttc tat get aca age 726 

5 Glu Leu Ala Val Gly. Thr Pro Pro Phe Tyr Ala Thr Ser 
195 200 

ate ttt cag ctg gtc age etc att etc aag gac cct gtg 765 
He Phe Gin Leu Val Ser Leu He Leu Lys Asp Pro Val 
205 210 215 

10 cgc tgg ccc tea acc ate agt ccc tgc ttt aag aac ttc 804 
Arg Trp Pro Ser Thr He Ser Pro Cys Phe Lys Asn Phe 
220 225 230 

ctg cag gga ctg etc acc aaa gac cca egg cag cga ctg 84 3 
Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg Gin Arg Leu 
15 235 240 

tec tgg cca gac etc tta tat cac ccc ttt att get ggt 882 
Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala Gly 
245 250 255 

cat gtc acc ata ata act gag cca gca ggc cca gat ttg 921 
20 His Val Thr lie He Thr Glu Pro Ala Gly Pro Asp Leu 
260 265 

ggg acc cca ttc acc age cgc eta ccc cca gaa ctt cag 960 
Gly Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu Gin 
270 275 280 

25 gtc eta aag gac gaa cag gec cat egg ttg gec ccc aag 999 
Val Leu Lys Asp Glu Gin Ala His Arg Leu Ala Pro Lys 
285 290 295 

ggt aat cag tct cgc ate ttg act cag gec tat aaa cgc 1038 
Gly Asn Gin Ser Arg He Leu Thr Gin Ala Tyr Lys Arg 
30 300 305 

atg get gag gag gec atg cag aag aaa cat cag aac aca 1077 
Met Ala Glu Glu Ala Met Gin Lys Lys His Gin Asn Thr 
310 315 320 

gga cct gec ctt gag caa gag gac aag acc age aag gtg 1116 
35 Gly Pro Ala Leu Glu Gin Glu Asp Lys Thr Ser Lys Val 
325 330 

get cct ggc aca gec cct ctg ccc aga etc ggg gec act 1155 
Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly Ala Thr 
335 340 345 

40 cct cag gaa tea age etc ctg gec ggg ate tta gec tea 1194 
Pro Gin Glu Ser Ser Leu Leu Ala Gly He Leu Ala Ser 
350 355 360 

gaa ttg aag age age tgg get aaa tea ggg act gga gag 1233 
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Glu Leu Lys Ser Ser Trp Ala Lys Ser Gly Thr Gly Glu 
365 370 

gtg ccc tct gca cct egg gaa aac egg acc acc cca gat 1272 
Val Pro Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro Asp 
375 380 3 8 5 

tgt gaa cga gca ttc cca gag gag agg cca gag gtg ctg 1311 
Cys Glu Arg Ala Phe Pro Glu Glu Arg Pro Glu Val Leu 
390 395 

ggc cag egg age act gat gta gtg gac ctg gaa aat gag 1350 
Gly Gin Arg Ser Thr Asp Val Val Asp Leu Glu Asn Glu 
400 405 410 

gag cca gac agt gac aat gag tgg cag cac ctg eta gag 1389 
Glu Pro Asp Ser Asp Asn Glu Trp Gin His Leu Leu Glu 
415 420 425 

acc act gag cct gtg cct att caa ctg aag get cct etc 1428 
Thr Thr Glu Pro Val Pro He Gin Leu Lys Ala Pro Leu 
430 435 

acc ttg ctg tgt aat cct gac ttc tgc cag cgc ate cag 1467 
Thr Leu Leu Cys Asn Pro Asp Phe Cys Gin Arg He Gin 
20 440 445 450 

agt cag ctg cat gaa get gga ggg cag ate ctg aaa ggc 1506 
Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys Gly 
455 460 



15 



25 



30 



ate ttg gag ggt get tec cac ate ctg cct gca ttc egg 1545 
He Leu Glu Gly Ala Ser His He Leu Pro Ala Phe Arg 
465 470 475 

gtc ctg age agt ctt etc tec age tgc agt gat tct gtt 1584 
Val Leu Ser Ser Leu Leu Ser Ser Cys Ser Asp Ser Val 
480 485 490 

gec ttg tat tec ttc tgc egg gag gca ggg ctt cct ggg 1623 
Ala Leu Tyr Ser Phe Cys Arg Glu Ala Gly Leu Pro Gly 
495 500 

ctg ctg ctg agt eta etc agg cac agt cag gag age aac 1662 
Leu Leu Leu Ser Leu Leu Arg His Ser Gin Glu Ser Asn 
35 505 510 515 

age etc cag cag caa tct tgg tat ggg acc ttc tta cag 1701 
Ser Leu Gin Gin Gin Ser Trp Tyr Gly Thr Phe Leu Gin 
520 525 



40 



gac ctg atg get gtg att cag gee tac ttt gee tgt acc 1740 
Asp Leu Met Ala Val lie Gin Ala Tyr Phe Ala Cys Thr 
530 535 540 

ttc aat ctg gag agg age cag aca agt gac age ctg cag 1779 
Phe Asn Leu Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin 
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545 



550 555 



gtg ttt cag gag get gec aac ctt ttt ctg gac ctg ttg 1818 
Val Phe Gin Glu Ala Ala Asn Leu Phe Leu Asp Leu Leu 
560 565 

5 ggg aaa ctg ctg gec caa cca gat gac tct gag cag act 1857 
Gly Lys Leu Leu Ala Gin Pro Asp Asp Ser Glu Gin Thr 
570 575 580 

ttg egg agg gac age ctt atg tgc ttt act gtc ctg tgc 1896 
Leu Arg Arg Asp Ser Leu Met Cys Phe Thr Val Leu Cys 
10 585 590 

gaa gec atg gat ggg aac age egg gec ate tec aaa gec 1935 
Glu Ala Met Asp Gly Asn Ser Arg Ala lie Ser Lys Ala 
595 600 605 

ttt tac tec age ttg ctg acg aca cag cag gtt gtc ttg 1974 
15 Phe Tyr Ser Ser Leu Leu Thr Thr Gin Gin Val Val Leu 

615 620 



20 



30 



610 



gat ggg etc ctt cat ggc ttg aca gtt cca cag etc cct 2013 
Asp Gly Leu Leu His Gly Leu Thr Val Pro Gin Leu Pro 
625 630 

gtc cac act ccc caa ggt tec eta etc ctg ctg cca tgt 2052 
Val His Thr Pro Gin Gly Ser Leu Leu Leu Leu Pro Cys 
635 640 645 

egg tga g t actggtgcta ttgtctaggg caagagcctc 20 90 
Arg Xaa 



25 648 



aggeetttgg agt tac tet ttg ctt tte tee aca gga gec 2130 
Ser Tyr Ser Leu Leu Phe Ser Thr Gly Ala 
5 10 



1 



ccg caa gtg age cag cca ctg cga gag cag agt gag gat 2169 
Pro Gin Val Ser Gin Pro Leu Arg Glu Gin Ser Glu Asp 



15 



20 



ata cct gga gec att tec tct gee ctg gca gee ata tgc 2208 
He Pro Gly Ala He Ser Ser Ala Leu Ala Ala He Cys 

30 35 



25 



35 act get cct gtg gga ctg ccc gac tgc tgg gat gec aag 2247 
Thr Ala Pro Val Gly Leu Pro Asp Cys Trp Asp Ala Lys 

45 



40 



gag cag gtc tgt tgg cat ttg gca aat cag eta act gaa 2286 
Glu Gin Val Cys Trp His Leu Ala Asn Gin Leu Thr Glu 
40 50 55 60 

gac age age cag etc agg cca tec etc ate tct ggc ctg 2325 
Asp Ser Ser Gin Leu Arg Pro Ser Leu He Ser Gly Leu 
65 70 75 
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cag cat ccc ate ctg tgc ctg cac ctt etc aag gtt eta 2364 
Gin His Pro lie Leu Cys Leu His Leu Leu Lys Val Leu 



10 



20 



25 



80 



85 



tac tec tgc tgc ctt gtc agt gag ggc ctg tgc cgt ctt 2403 
Tyr Ser Cys Cys Leu Val Ser Glu Gly Leu Cys Arq Leu 

ctg ggg cag gag ccc ctg gee ttg gaa tec ctg ttt atg 2442 
Leu Gly Gin Glu Pro Leu Ala Leu Glu Ser Leu Phe Met 
105 110 

ttg att cag ggc aag gta aaa gta gta gat tgg gaa gag 24 81 
Leu He Gin Gly Lys Val Lys Val Val Asp Trp Glu Glu 
115 120 12S 

tct act gaa gtg aca etc tac ttc etc tec ctt ctt gtc 2520 
Ser Thr Glu Val Thr Leu Tyr Phe Leu Ser Leu Leu Val 
15 130 135 140 

ttt egg etc eaa aac ctg cct tgt gga atg gag aag eta 2559 
Phe Arg Leu Gin Asn Leu Pro Cys Gly Met Glu Lys Leu 
145 150 



ggc agt gac gtt get act etc ttt ace cat teg cat gtc 2598 
Gly Ser Asp Val Ala Thr Leu Phe Thr His Ser His Val 
155 ISO 165 

gtc tct ctt gtg agt gca gca gee tgt eta ttg gga cag 2637 
Val Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin 
170 175 

ctt ggt cag eaa ggg gtg acc ttt gac etc cag ccc atg 2676 
Leu Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met 
180 185 190 

gaa tgg atg get gca gec aca cat gec ttg tct gec cct 2715 
Glu Trp Met Ala Ala Ala Thr His Ala Leu Ser Ai a Pro 
30 195 200 205 

gca gag etc etc act gag gta cag atg gat ctt ggg atg 2754 
Ala Glu Leu Leu Thr Glu Val Gin Met Asp Leu Gly Met 
210 215 

gat ggg aag taaagag agaggaactg ggcattttgg ggagcctctg 2800 
35 Asp Gly Lys 

220 221 

gaccagagga atgaagaagc aacccacagc cttccctctc aagctactgt 2850 
gcctgtgata gecttggaac ttccccgcct gccctcagta ctgacccttt 2900 
gaaggaaacc attcgetgeg tcccetggga tecagtggga gataaaatga 2950 
40 attccctggg tttcagcaga catacacatg agttgtgagg tcagagggtt 3000 
aaggtttgat aagaaaatga aataagacga cagggaaata ctaggtggga 3050 
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aagcggaagg aattatttct gggacttcct ttacttgtaa gtcagggaca 3100 
ggaatgaata aaagcatttg gattcctgac ttctgtcttt ccccccgccc 3150 
tctttcactt ttatctctag caggggaagg ctagcctaat cagggatatg 3200 
tccagttcag aaatgtggac cgttttgtgg caccgcttct ccatggtcct 32 50 
5 gaggctcccc gaggaggcat ctgcacagga aggggagctt tcgctatcca 33 00 
gtccaccaag ccctgagcca gactggacac tgatttctcc ccagggcatg 33 50 
gcagccctgc tgagcctggc catggccacc tttacccagg agccccagtt 34 00 
atgcctgagc tgcctgtccc agcatggaag tatcctcatg tccatcctga 34 50 
agcatctgct ttgccccagc ttcctgaatc aactgcgcca ggcgcctcat 3500 
10 gggtctgagt ttctccctgt cgtggtgctc tctgtctgcc agctcctttg 3550 
cttccccttt gcgctggaca tggatgctga cctccttata ggtgtcttgg 3600 
ccgacctcag ggactcagaa gttgcagccc atctgctgca ggtctgctgc 3650 
taccatcttc cgttgatgca agtggagctg cccatcagcc ttctcacacg 3700 
cctggccctc atggatccca cctctctcaa ccagtttgtg aacacagtgt 3750 
15 ctgcctcccc tagaaccatc gtctcgtttc tctcagttgc cctcctgagt 3800 
gaccagccac tgttgacctc cgaccttctc tctctgctgg cccatactgc 3850 
cagggtcctg tctcccagcc acttgtcctt tatccaagag cttctggctg 3900 
gctctgatga atcctatcgg cccctgcgca gcctcctggg ccacccagag 3 950 
aattctgtgc gggcacacac ttataggctc ctgggacact tgctccaaca 4000 
20 cagcatggcc ctgcgtgggg cactgcagag ccagtctgga ctgctcagcc 4050 
ttctgctgct tgggcttgga gacaaggatc ctgttgtgcg gtgcagtgcc 4100 
agctttgctg tgggcaatgc agcctaccag gctggtcctc tgggacctgc 4150 
cctggcagct gcagtgccca gcatgaccca gctgcttgga gatcctcagg 4200 
ctggtatccg gcgcaatgtt gcatcagctc tgggcaactt gggacctgaa 4250 
25 ggtttgggag aggagctgtt acagtgcgaa gtaccccagc ggctcctaga 4300 
aatggcatgt ggagaccccc agccaaatgt gaaggaggct gccctcattg 4350 
ccctccggag cctgcaacag gagcctggca tccatcaggt actggtgtcc 4400 
ctgggtgcca gtgagaaact atccttgctc tctctgggga atcagtcact 44 50 
gccacacagc agtcctaggc ctgcctctgc caaacactgc aggaaactca 4500 
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ttcacctcct gaggccagcc catagcatgt gattccagat tcctgcggtc 4550 

cagcctccaa ctttggttgc cagctctttc ttattctact acacaagccg 4600 

ccaactcaac tgagagctaa agagactaga aaagagataa gctgccaact 4650 

caactgagaa caagaaacta gaagagattt atatataaag cttcttcctt 4700 

ctcccagatg caggatgttt tcaaccagta aattttattg ctgttggtgc 4750 

cagagaagag tcctttcttc tctacatcca ggggcctttt ctccaataat 4800 

gtgcctttaa ctctagggac ctgcctcacg gaccttaggg aaaaacctca 4850 

acctgaaaga tctcttcctt tctggagctc ctttaatctt cccagcaggt 4900 

ttttgcctta gacgtgctgg ccccaggaca gtgatgaaga cagagcctgt 4 95 0 

ctcagctcta ggctgtgggg atcaatgcca tcagtccctg ttattgaggg 5000 

attatccctt agccaacatt cctatctgtg ggtgggcgtg gagagtgtat 5050 

ctttttttgg ggtgtgtgtg tatatgtgtg tgtgtatgtg tgtgtgtgtt 5100 

taatagttct gtttgtaaac tcttttaata aaagttgtgc ctcaccatac 5150 

ttgaagctcc caggacaagg gttgagaggc tcaacccctc tttcagcttc 5200 

tatgtggtgt tggaggtgct ggtatcgtgt tcacacaaaa aaaaaaaaaa 5250 

aa 5252 

<210> 7 

<211> '"47 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Met Glu Lys Tyr His Val Leu Glu Met lie Gly Glu Gly Ser Phe 
1 5 10 15 

Gly Arg Val Tyr Lys Gly Arg Arg Lys Tyr Ser Ala Gin Val Val 
20 25 30 

Ala Leu Lys Phe lie Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu 
35 40 45 

Arg Asn Leu Gin Arg Glu lie Glu lie Met Arg Gly Leu Arg His 
50 55 60 

Pro Asn lie Val His Met Leu Asp Ser Phe Glu Thr Asp Lys Glu 
65 70 75 

Val Val Val Val Thr Asp Tyr Ala Glu Gly Glu Leu Phe Gin lie 
80 85 90 

Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp Gin Val Gin Ala lie 
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95 100 105 

Ala Ala Gin Leu Val Ser Ala Leu Tyr Tyr Leu His Ser His Arg 
110 H5 120 

He Leu His Arg Asp Met Lys Pro Gin Asn He Leu Leu Ala Lys 
5 125 130 135 

Gly Gly Gly He Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met 
140 145 150 

Ser Thr Asn Thr Met Val Leu Thr Ser He Lys Gly Thr Pro Leu 
155 160 165 

10 Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp His Thr 

170 175 180 

Ala Asp Leu Trp Ser Val Gly Cys He Leu Tyr Glu Leu Ala Val 
185 190 195 

Gly Thr Pro Pro Phe Tyr Ala Thr Ser He Phe Gin Leu Val Ser 
15 200 205 210 

Leu He Leu Lys Asp Pro Val Arg Trp Pro Ser Thr lie Ser Pro 
215 220 225 

Cys Phe Lys Asn Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg 
230 235 240 

20 Gin Arg Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala 

245 250 255 

Gly His Val Thr He He Thr Glu Pro Ala Gly Pro Asp Leu Gly 
260 265 ??0 

Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu Gin Val Leu Lys 
25 275 280 285 

Asp Glu Gin Ala His Arg Leu Ala Pro Lys Gly Asn Gin Ser Arg 
290 295 300 

He Leu Thr Gin Ala Tyr Lys Arg Met Ala Glu Glu Ala Met Gin 
305 310 315 

30 Lys Lys His Gin Asn Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys 

320 325 330 

Thr Ser Lys Val Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly 
335 340 345 

Ala Thr Pro Gin Glu Ser Ser Leu Leu Ala Gly He Leu Ala Ser 
35 350 355 360 

Glu Leu Lys- Ser Ser Trp Ala- Lys Ser Gly Thr Gly Glu Val Pro 
365 370 375 

Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro Asp Cys Glu Arg Ala 
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380 3 8 5 390 

Phe Pro Glu Glu Arg Pro Glu Val Leu Gly Gin Arg Ser Thr Asp 
395 400 405 

Val Val Asp Leu Glu Asn Glu Glu Pro Asp Ser Asp Asn Glu Trp 
410 415 420 

Gin His Leu Leu Glu Thr Thr Glu Pro Val Pro He Gin Leu Lys 
425 430 435 

Ala Pro Leu Thr Leu Leu Cys Asn Pro Asp Phe Cys Gin Arg He 
440 445 450 

Gin Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys Gly He 
455 460 465 

Leu Glu Gly Ala Ser His He Leu Pro Ala Phe Arg Val Leu Ser 
470 475 480 

Ser Leu Leu Ser Ser Cys Ser Asp Ser Val Ala Leu Tyr Ser Phe 
15 485 490 495 

Cys Arg Glu Ala Gly Leu Pro Gly Leu Leu Leu Ser Leu Leu Arg 
500 505 510 

His Ser Gin Glu Ser Asn Ser Leu Gin Gin Gin Ser Trp Tyr Gly 
515 520 525 

20 Thr Phe Leu Gin Asp Leu Met Ala Val He Gin Ala Tyr Phe Ala 

530 535 540 

Cys Thr Phe Asn Leu Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin 
545 550 555 

Val Phe Gin Glu Ala Ala Asn Leu Phe Leu Asp Leu Leu Gly Lys 
25 560 565 570 

Leu Leu Ala Gin Pro Asp Asp Ser Glu Gin Thr Leu Arg Arg Asp 
575 580 585 

Ser Leu Met Cys Phe Thr Val Leu Cys Glu Ala Met Asp Gly Asn 
590 595 600 

30 Ser Arg Ala He Ser Lys Ala Phe Tyr Ser Ser Leu Leu Thr Thr 

605 610 615 

Gin Gin Val Val Leu Asp Gly Leu Leu His Gly Leu Thr Val Pro 
620 625 630 

Gin Leu Pro Val His Thr Pro Gin Gly Ser Leu Leu Leu Leu Pro 
35 635 640 645 

Cys Arg 
647 

<210> 8 
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<211> 32 
<212> DNA 

<213> artificial sequence 
<220> 

5 <223> artificial sequence 
<400> 8 

caatacaatg gtgctgacat ccatcaaagg ca 32 

<210> 9 
<211> 23 
10 <212> DNA 

<213> artificial sequence 

<220> 

<223> artificial sequence 
<400> 9 

15 gaagggaggg gtgcctactg cca 23 

<210> 10 

<211> 38 

<212> DNA 

<213> artificial sequence 

20 <220> 

<223> artificial sequence 

<400> 10 

ctccagctct ggagacatat agagtggtgt gcctttga 38 

<210> 11 
25 <211> 60 
<212> DNA 

<213> artificial sequence 
<220> 

<223> artificial sequence 

30 <400> 11 _„ 
ccatcgatgt acccatacga cgtcccagac tacgctgaaa agtaccacgt 50 

gttggagatg 60 

<210> 12 
<211> 32 
35 <212> DNA 

<213> artificial sequence 

<220> 

<223> artificial sequence 
<400> 12 

40 gctctagact aaggggcagg tcctgtgttc tg 32 
<210> 13 
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<211> 21 
<212> DNA 

<213> artificial sequence 
<220> 

5 <223> artificial sequence 

<400> 13 
ctgacgacac agcaggttgt c 21 

<210> 14 
<211> 21 
10 <212> DNA 

<213> artificial sequence 

<220> 

<223> artificial sequence 
<400> 14 

15 cagatgcttc aggatggaca t 21 

<210> 15 

<211> 19 

<212> DNA 

<213> artificial sequence 

20 <220> 

<223> artificial sequence 

<400> 15 
agagtagcaa cgtcactgc 19 

<210> 16 
25 <211> 21 
<212> DNA 

<213> artificial sequence 
<220> 

<223> artificial sequence 

30 <400> 16 

cctcactgac aaggcagcag g 21 

<210> 17 
<211> 21 
<212> DNA 
35 <213> artificial sequence 

<220> 

<223> artificial sequence 

<400> 17 
cccgaggagg catctgcaca g 21 

40 <210> IS 
<211> 21 
<212> DNA 
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<213> artificial sequence 

<220> 

<223> artificial sequence 

<400> 18 
5 cagaacttca ggtcctaaag g 21 

<210> 19 
<211> 32 
<212> DNA 

<213> artificial sequence 



10 <220> 

<223> artificial sequence 



<400> 19 

tcgacaagca gggaacaccc aagtagaagc 

<210> 20 
15 <211> 32 
<212> DNA 

<213> artificial sequence 



<220> 

<223> artificial sequence 



20 <400> 20 

tcgacaagca gggaagtggg aagtagaagc tc 32 

<210> 21 
<211> 685 
<212> PRT 
25 <213> Homo sapiens 



< Ser > Tyi Ser Leu Leu Phe Ser Thr Gly Ala Pro Gin Val Ser Gin 
1 5 10 15 

Pro Leu Arg Glu Gin Ser Glu Asp He Pro Gly Ala He Ser Ser 



30 



40 



20 



25 



30 



Ala Leu Ala Ala He Cys Thr Ala Pro Val Gly Leu Pro Asp Cys 



35 



40 



45 



Trp Asp Ala Lys Glu Gin Val Cys Trp His Leu Ala Asn Gin Leu 
50 55 60 

35 Thr Glu Asp Ser Ser Gin Leu Arg Pro Ser Leu He Ser Gly Leu 

65 70 75 

Gin His Pro He Leu Cys Leu His Leu Leu Lys Val Leu Tyr Ser 



80 



85 90 



Cvs Cys Leu Val Ser Glu Gly Leu Cys Arg Leu Leu Gly Gin Glu 
y " 7 95 100 105 
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Pro Leu Ala Leu Glu Ser Leu Phe Met Leu lie Gin Gly Lys Val 
110 115 120 

Lys Val Val Asp Trp Glu Glu Ser Thr Glu Val Thr Leu Tyr Phe 
125 130 135 

5 Leu Ser Leu Leu Val Phe Arg Leu Gin Asn Leu Pro Cys Gly Met 

140 145 150 

Glu Lys Leu Gly Ser Asp Val Ala Thr Leu Phe Thr His Ser His 
155 160 165 

Val Val Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu 
10 170 175 180 

Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met Glu Trp Met 
185 190 195 

Ala Ala Ala Thr His Ala Leu Ser Ala Pro Ala Glu Val Arg Leu 
200 205 210 

15 Thr Pro Pro Gly Ser Cys Gly Phe Tyr Asp Gly Leu Leu lie Leu 

215 220 225 

Leu Leu Gin Leu Leu Thr Glu Gin Gly Lys Ala Ser Leu lie Arg 
230 235 240 

Asp Met Ser Ser Ser Glu Met Trp Thr Val Leu Trp His Arg Phe 
20 245 250 255 

Ser Met Val Leu Arg Leu Pro Glu Glu Ala Ser Ala Gin Glu Gly 
260 265 270 

Glu Leu Ser Leu Ser Ser Pro Pro Ser Pro Glu Pro Asp Trp Thr 
275 280 285 

25 Leu lie Ser Pro Gin Gly Met Ala Ala Leu Leu Ser Leu Ala Met 

290 295 300 

Ala Thr Phe Thr Gin Glu Pro Gin Leu Cys Leu Ser Cys Leu Ser 
305 310 315 

Gin His Gly Ser lie Leu Met Ser lie Leu Lys His Leu Leu Cys 
30 320 325 330 

Pro Ser Phe Leu Asn Gin Leu Arg Gin Ala Pro His Gly Ser Glu 
335 340 345 

Phe Leu Pro Val Val Val Leu Ser Val Cys Gin Leu Leu Cys Phe 
350 355 360 

35 Pro Phe Ala Leu Asp Met Asp Ala Asp Leu Leu He Val Val Leu 

365 370 375 

Ala Asp Leu Arg Asp Ser Glu Val Ala Ala His Leu Leu Gin Val 
380 385 390 
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Cys Cys Tyr His Leu Pro Leu Met Gin Val Glu Leu Pro He Ser 
395 400 405 

Leu Leu Thr Arg Leu Ala Leu Met Asp Pro Thr Ser Leu Asn Gin 
410 415 420 

5 Phe Val Asn Thr Val Ser Ala Ser Pro Arg Thr He Val Ser Phe 

425 430 435 

Leu Ser Val Ala Leu Leu Ser Asp Gin Pro Leu Leu Thr Ser Asp 
440 445 450 

Leu Leu Ser Leu Leu Ala His Thr Ala Arg Val Leu Ser Pro Ser 
10 455 460 465 

His Leu Ser Phe He Gin Glu Leu Leu Ala Gly Ser Asp Glu Ser 
470 475 480 

Tyr Arg Pro Leu Arg Ser Leu Leu Gly His Pro Glu Asn Ser Val 
485 490 495 

15 Arg Ala His Thr Tyr Arg Leu Leu Gly His Leu Leu Gin His Ser 

500 505 510 

Met Ala Leu Arg Gly Ala Leu Gin Ser Gin Ser Gly Leu Leu Ser 
515 520 525 

Leu Leu Leu Leu Gly Leu Gly Asp Lys Asp Pro Val Val Arg Cys 
20 530 535 540 

Ser Ala Ser Phe Ala Val Gly Asn Ala Ala Tyr Gin Ala Gly Pro 
545 550 555 

Leu Gly Pro Ala Leu Ala Ala Ala Val Pro Ser Met Thr Gin Leu 
560 565 570 

25 Leu Gly Asp Pro Gin Ala Gly He Arg Arg Asn Val Ala Ser Ala 

575 580 585 

Leu Gly Asn Leu Gly Pro Glu Gly Leu Gly Glu Glu Leu Leu Gin 
590 595 600 

Cys Glu Val Pro Gin Arg Leu Leu Glu Met Ala Cys Gly Asp Pro 
30 605 610 615 

Gin Pro Asn Val Lys Glu Ala Ala Leu He Ala Leu Arg Ser Leu 

620 625 630 

Gin Gin Glu Pro Gly He His Gin Val Leu Val Ser Leu Gly Ala 
635 640 645 

35 Ser Glu Lys Leu Ser Leu Leu Ser Leu Gly Asn Gin Ser Leu Pro 

650 655 660 

His Ser Ser Pro Arg Pro Ala Ser Ala Lys His Cys Arg Lys Leu 
665 670 675 
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He His Leu Leu Arg Pro Ala His Ser Met 
680 G85 

<210> 22 
<211> 221 
5 <212> PRT 

<213> Homo sapiens 

<400> 22 

Ser Tyr Ser Leu Leu Phe Ser Thr Gly Ala Pro Gin Val Ser Gin 
15 10 15 

10 Pro Leu Arg Glu Gin Ser Glu Asp He Pro Gly Ala He Ser Ser 

20 25 30 

Ala Leu Ala Ala He Cys Thr Ala Pro Val Gly Leu Pro Asp Cys 
35 40 45 

Trp Asp Ala Lys Glu Gin Val Cys Trp His Leu Ala Asn Gin Leu 
15 50 55 60 

Thr Glu Asp Ser Ser Gin Leu Arg Pro Ser Leu He Ser Gly Leu 
65 70 75 

Gin His Pro lie Leu Cys Leu His Leu Leu Lys Val Leu Tyr Ser 
30 85 90 

2 0 Cys Cys Leu Val Ser Glu Gly Leu Cys Arg Leu Leu Gly Gin Glu 

95 100 105 

Pro Leu Ala Leu Glu Ser Leu Phe Met Leu He Gin Gly Lys Val 
110 115 120 

Lys Val Val Asp Trp :lu Glu Ser Thr Glu Val Thr Leu Tyr Phe 
25 125 130 135 

Leu Ser Leu Leu Val Phe Arg Leu Gin Asn Leu Pro Cys Gly Met 
140 145 150 

Glu Lys Leu Gly Ser Asp Val Ala Thr Leu Phe Thr His Ser His 
155 160 165 

30 Val Val Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu 

170 175 180 

Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met Glu Trp Met 
185 190 195 

Ala Ala Ala Thr His Ala Leu Ser Ala Pro Ala Glu Leu Leu Thr 
35 200 205 210 

Glu Val Gin Met Asp Leu Gly Met Asp Gly Lys 
215 220 221 

<210> 23 
<211> 795 
4 0 c212> PRT 
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<213> Drosophila virilis 
<400> 23 

Met Asp Arg Tyr Ala Val Ser Ser Leu Val Gly Gin Gly Ser Phe 

15 10 15 

5 Gly Cys Val Tyr Lys Ala Gin Arg Arg Asp Asp Asp Lys Val Val 

20 25 30 

Ala lie Lys Val lie Ser Lys Arg Gly Arg Ser Asn Arg Glu Leu 

35 40 45 

Lys Asn Leu Arg Arg Glu Cys Asp He Gin Ala Arg Leu Lys His 

10 50 55 60 

Pro His Val He Glu Met Val Glu Ser Phe Glu Ser Lys Phe Asp 

65 70 75 

Leu Phe Val Val Thr Glu Phe Ala Leu Met Asp Leu His Arg Tyr 

80 85 90 

15 Leu Ser Phe Asn Gly Ala Met Pro Glu Glu His Ala Gin Arg Val 

95 100 105 

Val Cys His Leu Val Ser Ala Leu Tyr Tyr Leu His Ser Asn Arg 

110 115 120 

He Leu His Arg Asp Leu Lys Pro Gin Asn Val Leu Leu Asp Lys 

20 125 130 135 

Asn Met His Ala Lys Leu Cys Asp Phe Gly Leu Ala Arg Asn Met 

140 145 150 

Thr Met Gly Thr His Val Leu Thr Ser He Lys Gly Thr Pro Leu 

155 160 165 

25 Tyr Met Ala Pro Glu Leu Leu Ala Glu Gin Pro Tyr Asp His Gin 

170 175 180 

Ala Asp Met Trp Ser Leu Gly Cys He Ala Tyr Glu Ser Met Ala 

185 190 195 

Gly Gin Pro Pro Phe Cys Ala Thr Ser He Leu His Leu Val Lys 

30 200 205 210 

Leu He Lys His Glu Asp Val Lys Trp Pro Ser Thr Leu Ser Ser 

215 220 225 

Glu Cys Arg Ser Phe Leu Gin Gly Leu Leu Glu Lys Asp Pro Ser 

230 235 240 

35 Met Arg lie Ser Trp Thr Gin Leu Leu Cys His Pro Phe Val Glu 

245 250 255 

Gly Lys Leu Tyr He Ala Glu Val Gin Ala Ala Gin Thr Ser Pro 

260 265 270 
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Phe lie Asn Pro Gin Leu Ala Lys Asp Thr Lys Lys Ser Gin Gin 
275 280 285 

Leu Arg His Val Gly Ala Asp Leu Gly Asp Val Leu Ala Ala Leu 
290 295 300 

5 Lys Leu Ser Asp Val Ala Asn Glu Asn Leu Ser Thr Ser Arg Asp 

305 310 315 

Ser He Asn Ala He Ala Pro Ser Asp He Glu Gin Leu Glu Thr 
320 325 330 

Asp Val Glu Asp Asn Val His Arg Leu He Val Pro Phe Ala Asp 
10 335 340 345 

He Ser Tyr Arg Glu Leu Pro Cys Gly Thr Ala Ala Ala Ala Arg 
350 355 360 

Arg Ala Gly Ala Met Pro Leu He Asn Ser Gin Thr Cys Phe Val 
365 370 375 

15 Ser Gly Asn Ser Asn Met He Leu Asn His Leu Asn Asp Asn Phe 

380 385 390 

Ala He Glu Ala Pro Ala Ser Ser Ala Thr Lys Ser Met Lys Ser 
395 40C 405 

Lys Leu Lys Leu Ala Leu Asn He Lys Gin Ser Arg Ser Lys Asp 
20 410 415 420 

Leu Glu Lys Arg Lys Leu Ser Gin Asn Leu Asp Asn Phe Ser Leu 
425 430 435 

Arg Leu Gly Gin Ser He Asp He Glu Val Gin Arg Lys Thr Thr 
440 445 450 

25 Glu Met Leu Thr Gin Gin Ser Gin Ala Gin Gin Leu Gin Asp Arg 

455 460 465 

Lys Thr Gin Gin Leu Lys Gin Ser Met His Ser Thr Asn Asp Glu 
470 475 480 

Lys Leu Ser Ser Asp Asn Ser Pro Pro Cys Leu Leu Pro Gly Trp 
30 485 490 495 

Asp Ser Cys Asp Glu Ser Gin Ser Pro Pro He Glu Asn Asp Glu 
500 505 510 

Trp Leu Ala Phe Leu His Arg Ser He Gin Glu Leu Leu Asp Gly 
515 520 525 

35 Glu Phe Asp Ser Leu Lys Gin His Asn Leu Val Ser He He Val 

530 535 540 

Ala Pro Leu Arg Asn Ser Lys Ala He Pro Lys Val Leu Gin Ser 
545 550 555 
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Val Ala Gin Leu Leu Ser Leu Pro Phe Val Leu Ala Glu Gin His 
560 565 570 

Leu Val Ala Glu Ala lie Lys Gly Val Tyr He Asp Val Lys Leu 
575 580 585 

5 Val Pro Asn Leu Met Tyr Ala Cys Lys Leu Leu Leu Ser Gin Arg 

590 595 600 

His Leu Thr Asp Ser Ala Ala Ser Leu Pro Ala Gly Thr Gly Val 
605 610 615 

Ser Leu Ser Arg Thr Val Arg Ser Cys Ser Asp Leu Ser Ala Glu 
X0 620 625 630 

Glu Met Ser Thr Ala Cys Ser Leu Tyr Glu Leu Val Cys His Leu 
635 640 645 

Val His Gin Gin Gin Gin Phe Leu Thr Gin Phe Cys Asp Ala Val 
650 655 660 

15 Ala He Leu Ala Val Asn Asp Met Phe He Asn Phe Leu Thr His 

665 670 675 

Asp Phe Lys Asp Ser Arg Pro Val Arg Leu Ala Ser Cys Met Leu 
680 685 690 

Ala Leu Phe Cys Cys Val Leu Arg Glu Leu Pro Glu Asn Ala Glu 
20 695 700 705 

Leu Val Glu Lys He Val Phe Asp Ser Arg Leu Gin Leu Ala Val 
710 715 720 

Leu Leu Gin Ser Arg His His Leu Leu Arg Gin Arg Ala Cys Gin 
725 730 735 

2 5 Met Leu Leu Leu Leu Ala Arg Phe Ser Leu Arg Gly Val Gin Cys 

740 745 750 

He Trp Ser Gly Glu Leu Lys Ser Ala Leu Gin Ala Trp Pro Met 
755 760 765 

Gin Gin Thr Cys Gin Ser Leu Arg Leu Glu Ala Ala Gin Thr Leu 
30 770 775 730 

Asp Glu Leu Ser Gin Phe Ser Phe Phe Val Ala Gin Ala Thr Ala 
7S5 790 795 

<210> 24 
<211> 260 
35 <212> PRT 

<213> Homo sapiens 

<400> 24 

Met Glu Lys Tyr His Val Leu Glu Met He Gly Glu Gly Ser Phe 
1 5 10 15 
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Gly Arg Val Tyr Lys Gly Arg Arg Lys Tyr Ser Ala Gin Val Val 
20 25 30 

Ala Leu Lys Phe lie Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu 
35 40 45 

5 Arg Asn Leu Gin Arg Glu He Glu He Met Arg Gly Leu Arg His 

50 55 60 

Pro Asn He Val His Met Leu Asp Ser Phe Glu Thr Asp Lys Glu 
65 70 75 

Val Val Val Val Thr Asp Tyr Ala Glu Gly Glu Leu Phe Gin He 
10 80 85 90 

Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp Gin Val Gin Ala He 
95 100 105 

Ala Ala Gin Leu Val Ser Ala Leu Tyr Tyr Leu His Ser His Arg 
110 115 120 

15 He Leu His Arg Asp Met Lys Pro Gin Asn He Leu Leu Ala Lys 

125 130 135 

Gly Gly Gly He Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met 
140 145 150 

Ser Thr Asn Thr Met Val Leu Thr Ser He Lys Gly Thr Pro Leu 
20 155 160 165 

Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp His Thr 
170 175 180 

Ala Asp Leu Trp Ser Val Gly Cys He Leu Tyr Glu Leu Ala _ .'al 
185 190 195 

25 Gly Thr Pro Pro Phe Tyr Ala Thr Ser He Phe Gin Leu Val Ser 

200 205 210 

Leu He Leu Lys Asp Pro Val Arg Trp Pro Ser Thr He Ser Pro 
215 220 225 

Cys Phe Lys Asn Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg 
30 230 235 240 

Gin Arg Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala 
245 250 255 

Gly His Val Thr He 
260 

35 <210> 25 

<211> 1315 

<212> PRT ... 

<213> artificial sequence 

<220> 
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<223> artificial sequence 

< Met > Glu Lys Tyr His Val Leu Glu Met lie Gly Glu Gly Ser Phe 
X 5 10 15 

Gly Arg Val Tyr Lys Gly Arg Arg Lys Tyr Ser Ala Gin Val Val 
20 25 30 

Ala Leu Arg Phe lie Pro Lys Leu Gly Arg Ser Glu Lys Glu Leu 
35 40 

Arg Asn Leu Gin Arg Glu He Glu He Met Arg Gly Leu Arg His 
50 55 60 

Pro Asn lie Val His Met Leu Asp Ser Phe Glu Thr Asp Lys Glu 
65 70 75 

Val val Val Val Thr Asp Tyr Ala Glu Gly Glu Leu Phe Gin He 
80 85 90 

Leu Glu Asp Asp Gly Lys Leu Pro Glu Asp Gin Val Gin Ala He 
95 100 1° 5 



Ala Ala Gin Leu Val Ser Ala Leu Tyr Tyr Leu His Ser His Arg 
110 



115 120 



20 



He Leu His Arg Asp Met Lys Pro Gin Asn He Leu Leu Ala Lys 

125 130 I 35 

Gly Gly Gly He Lys Leu Cys Asp Phe Gly Phe Ala Arg Ala Met 

- , r. 150 



140 



Ser Thr Asn Thr Met Val Leu Thr Ser He Lys Gly Thr ?ro Leu 
155 160 165 

2 5 Tyr Met Ser Pro Glu Leu Val Glu Glu Arg Pro Tyr Asp His Thr 

170 1^5 I 80 

Ala Asp Leu Trp Ser Val Gly Cys He Leu Tyr Glu Leu Ala Val 
185 190 195 

Gly Thr Pro Pro Phe Tyr Ala Thr Ser He Phe Gin Leu Val Ser 
30 200 205 210 

Leu He Leu Lys Asp Pro Val Arg Trp Pro Ser Thr He Ser Pro 
215 220 225 

Cys Phe Lys Asn Phe Leu Gin Gly Leu Leu Thr Lys Asp Pro Arg 
230 235 240 

3 5 Gin Arg Leu Ser Trp Pro Asp Leu Leu Tyr His Pro Phe He Ala 

245 250 255 

Glv "is Val Thr He He Thr Glu Pro Ala Gly Pro Asp Leu Gly 
260 265 270 
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Thr Pro Phe Thr Ser Arg Leu Pro Pro Glu Leu Gin Val Leu Lys 
275 280 285 

Asp Glu Gin Ala His Arg Leu Ala Pro Lys Gly Asn Gin Ser Arg 
290 295 300 

5 lie Leu Thr Gin Ala Tyr Lys Arg Met Ala Glu Glu Ala Met Gin 

305 310 315 

Lys Lys His Gin Asn Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys 
320 325 330 

Thr Ser Lys Val Ala Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly 
10 335 340 345 

Ala Thr Pro Gin Glu Ser Ser Leu Leu Ala Gly lie Leu Ala Ser 
350 355 360 

Glu Leu Lys Ser Ser Trp Ala Lys Ser Gly Thr Gly Glu Val Pro 
365 370 375 

15 Ser Ala Pro Arg Glu Asn Arg Thr Thr Pro Asp Cys Glu Arg Ala 

380 385 390 

Phe Pro Glu Glu Arg Pro Glu Val Leu Gly Gin Arg Ser Thr Asp 
395 400 405 

Val Val Asp Leu Glu Asn Glu Glu Pro Asp Ser Asp Asn Glu Trp 
20 410 415 42Q 

Gin His Leu Leu Glu Thr Thr Glu Pro Val Pro lie Gin Leu Lys 
425 430 435 

Ala Pro Leu Thr Leu Leu Cys Asn Pro Asp Phe Cys Cln Arg He 
440 445 450 

25 Gin Ser Gin Leu His Glu Ala Gly Gly Gin He Leu Lys Gly He 

455 460 465 

Leu Glu Gly Ala Ser His He Leu Pro Ala Phe Arg Val Leu Ser 
470 475 480 

Ser Leu Leu Ser Ser Cys Ser Asp Ser Val Ala Leu Tyr Ser Phe 
30 485 490 495 

Cys Arg Glu Ala Gly Leu Pro Gly Leu Leu Leu Ser Leu Leu Arg 
500 505 510 

His Ser Gin Glu Ser Asn Ser Leu Gin Gin Gin Ser Trp Tyr Gly 
515 520 525 

35 Thr Phe Leu Gin Asp Leu Met Ala Val He Gin Ala Tyr Phe Ala 

530 535 540 

Cys Thr Phe Asn Leu Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin 
545 550 555 
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Val Phe Gin Glu Ala Ala Asn Leu Phe Leu Asp Leu Leu Gly Lys 
560 565 "0 

Leu Leu Ala Gin Pro Asp Asp Ser Glu Gin Thr Leu Arg Arg Asp 
575 580 585 



5 Ser 



Leu Met Cys Phe Thr Val Leu Cys Glu Ala Met Asp Gly Asn 



590 595 



600 



10 



Ser Arg Ala He Ser Lys Ala Phe Tyr Ser Ser Leu Leu Thr Thr 
605 610 615 

Gin Gin Val Val Leu Asp Gly Leu Leu His Gly Leu Thr Val Pro 
620 625 630 

Gin Leu Pro Val His Thr Pro Gin Gly Ala Pro Gin Val Ser Gin 
635 640 645 

Pro Leu Arg Glu Gin Ser Glu Asp lie Pro Gly Ala lie Ser Ser 



650 



655 



660 



15 Ala Leu Ala Ala lie Cys Thr Ala Pro Val Gly Leu Pro Asp Cys 



665 



670 



675 



Tr P Asp Ala Lys Glu Gin Val Cys Trp His Leu Ala Asn Gin Leu 
680 685 690 



Thr Glu Asp Ser Ser Gin Leu Arg Pro Ser Leu He Ser Gly Leu 
20 ' 695 700 70= 

Gin His Pro He Leu Cys Leu His Leu Leu Lys Val Leu Tyr Ser 
710 715 720 

Cys Cys Leu Val Ser Glu Gly Leu Cys Arg Leu teu Gly Gin Glu 
725 730 735 

25 Pro Leu Ala Leu Glu Ser Leu Phe Met Leu He Gin Gly Lys Val 

740 745 750 

Lys Val Val Asp Trp Glu Glu Ser Thr Glu Val Thr Leu Tyr Phe 

— • — 76 b 



755 



760 



30 



35 



Leu Ser Leu Leu Val Phe Arg Leu Gin Asn Leu Pro Cys Gly Met 
770 775 7 80 

Glu Lys Leu Gly Ser Asp Val Ala Thr Leu Phe Thr His Ser His 
785 790 795 

Val Val Ser Leu Val Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu 
800 805 810 

Gly Gin Gin Gly Val Thr Phe Asp Leu Gin Pro Met Glu Trp Met 



815 



820 825 



Ala Ala Ala Thr His Ala Leu Ser Ala Pro Ala Glu Val Arg Leu 
830 835 8,0 
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Thr Pro Pro Gly Ser Cys Gly Phe Tyr Asp Gly Leu Leu lie Leu 
845 850 855 

Leu Leu Gin Leu Leu Thr Glu Gin Gly Lys Ala Ser Leu lie Arg 
860 865 8 70 

5 Asp Met Ser Ser Ser Glu Met Trp Thr Val Leu Trp His Arg Phe 

875 880 885 

Ser Met Val Leu Arg Leu Pro Glu Glu Ala Ser Ala Gin Glu Gly 



890 895 



900 



Glu Leu Ser Leu Ser Ser Pro Pro Ser Pro Glu Pro Asp Trp Thr 
10 905 910 915 

Leu lie Ser Pro Gin Gly Met Ala Ala Leu Leu Ser Leu Ala Met 
920 925 930 



Ala Thr Phe Thr Gin Glu Pro Gin Leu Cys Leu Ser Cys Leu Ser 
935 940 945 

15 Gin His Gly Ser He Leu Met Ser He Leu Lys His Leu Leu Cys 



950 955 



960 



Pro Ser Phe Leu Asn Gin Leu Arg Gin Ala Pro His Gly Ser Glu 
9g 5 970 ' 975 

Phe Leu Pro Val Val Val Leu Ser Val Cys Gin Leu Leu Cys Phe 



20 980 985 



990 



Pro Phe Ala Leu Asp Met Asp Ala Asp Leu Leu He Val Val Leu 
995 1000 100 5 

Ala Asp Leu Arg Asp Ser Glu Val Ala Ma His Leu Leu Gin Val 
1010 1015 1020 

25 Cys Cys Tyr His Leu Pro Leu Met Gin Val Glu Leu Pro lie Ser 

1025 1030 1Q35 

Leu Leu Thr Arg Leu Ala Leu Met Asp Pro Thr Ser Leu Asn Gin 
1040 1045 1050 

Phe Val Asn Thr Val Ser Ala Ser Pro Arg Thr He Val Ser Phe 
30 1055 1060 1065 

Leu Ser Val Ala Leu Leu Ser Asp Gin Pro Leu Leu Thr Ser As- 
1070 1075 108 o 

Leu Leu Ser Leu Leu Ala His Thr Ala Arg Val Leu Ser Pro Ser 
1085 1090 1Q95 

35 His Leu Ser Phe He Gin Glu Leu Leu Ala Gly Ser Asp Glu Ser 

1100 li05 1110 



Tyr Arg Pro Leu Arg Ser Leu Leu Gly His Pro" Glu Asn Ser Val 
H15 1120 1125 
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^ Al a His Thr Tyr Arg Leu Leu Gly Hi. Leu Leu Gin Hi.^r 
1130 XXJ3 

„« Ala leu x, =iy «• «. «» s « f " s " Gly " u " 

1145 xx:5 

, leu l*u leu leu Gly leu Gly Asp W s «P « -1 val 

1160 lxb:3 

ser Ala Mr Pne ,1a val Gly Asa Ala Ala Tyr da Ala «y» 
1175 1180 

Ma nla Val Pro Ser Met Thr Gin Leu 
Leu Gly Pro Ala Leu Ala Ala Ala Val Pro ^ 

10 1190 

*i = rlv Tie Arq Arg Asn Val Ala Ser Ala 
Leu Gly Asp Pro Gin Ala Gly He Arg a y ^ 

1205 xZXU 

Leu G ly Asn leu Gly Pro Glu Gly leu Gly G!u G!u leu l.uda 
1220 X ^ S 

„ Cys Gla val Pro Gla Ar g lea u« Glu Met Ala cys Gly «P «o 

1235 X ^ 4U 

Gln Pro Asn val lys Gla Ala Ala lea II. Ala lav Ar g s.^lea 
1250 

«» Gl„ Gla Pre Gly He His Gin val «u val ser lea GlyM. 
20 1265 

S .r Gla lys lea Ser lea lea Ser leu Gly Asn Gla ser leu^ro 
1280 1285 

His ser ser Pro Ar g Pro Ala .,r Ala lys His cys Ar g W »- 
1295 XJU 

25 lie His Leu Leu Arg Pro Ala His Ser Met 

1310 

<210> 26- 
<211> 4586 
<212> DNA 
30 <213> Drosophila virilis 

^cagagtct gggecategg ctagctctgt agatgtgtaa tagaggcatc 50 
ttcgcgcgca gcatcgattc gcgctccagt tggttgagat gccgaatgtt 100 
ggtgcgcgtc tcaaagatga cgccgacatg gtgeatatae agaaaaaaga 150 
3S aacccgagcc taatggccgc gtattatege tgaggeggee ggegatttte 200 
aacaaatget acttaccaat tagcgcgtgc gataaaatta cgtacaaatt 250 

300 



ggcgtcgcgc 



atatttctgg cgtttgtgtt gcgtgtatct agttagtggg 
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ctcgtctatt cattatttac ttttggggcg tctgtttgat caaattagca 350 
gtgtctccta tgatatgcct gcagctttta cccgtaaaca aaattatttg 4 00 
ccacagctga tttattcgtt gccatgtaga ttaatcagct gtttcgcaat 450 
ttttaaaacc aggtgacttt ttaaaattgt accagctgtg tgtatcgatg 500 
5 tgcaagcata ctcattacgc catatgctgg tatatttata tcgaatataa 550 
acggttttgg tattttaata atcttaaaga agaaatagtt atgtgctgtg 600 
tatatgttca tcaagaactg ttcaaaatgt gcgccatact gatgttaatt 650 
ttgttttgct ggtttttttt gggaaaataa attgacgtgt tgatgtctcc 700 
gaatatatcg atacaatagc tatcattcgg acaagatatc gatatgtgga 750 

10 gtgtgttcgg tattttgcct ttagtttttt gtttttaaat tgcagtcaca 800 
ctgcggctta ttgaatttaa ggcacttcaa agcgcatttt actgtagaaa 850 
gttgagttct atttgcggtg acaatggacc gctacgcggt tagctctctg 900 
gtaggacaag gctcatttgg ctgtgtgtac aaggcccagc ggcgcgatga 950 
tgacaaagtg gtggccatca aagtcatatc aaaggtgagc tcaattgcat 1000 

15 cccggcttag ctgaataaaa gagtattcta cgaattggcg tgttctttgt 1050 
ttgcagcgtg gtcgttccaa tcgcgagctt aagaacctgc gtcgcgaatg 1100 
tgacattcag gcgcgtctca agcatccgca tgttatagaa atggtggagt 1150 
cgttcgaatc caaattcgat ttgttcgtgg tcaccgagtt cgctctaatg 1200 
gacttgcatc gatatttgtc ctttaatggc gccatgcccg aggagcacgc 1250 

20 acagcgtgtt gtctgtcatt tggtgtcggc gctctattat ctgcactcga 1300 
atcgcatact gcatcgggat ctaaagccgc aaaatgtgct gttggacaaa 1350 
aacatgcacg ccaagctctg cgactttggg ctggcacgca acatgacgat 1400 
gggcacacat gtgttgactt ccataaaggg cacgccgctt tatatggcgc 1450 
cggagctgct ggctgagcag ccgtacgatc accaggcaga tatgtggtcg 1500 

25 ctgggatgca ttgcctatga gagtatggcg ggccagccgc cgttctgcgc 1550 
aacctctata ctgcatctgg tgaagctgat caagcacgag gacgtcaaat 1600 
ggccgagcac gctgagcagc gagtgccgtt cctttttgca gggcttgctc 1650 
gagaaggatc ctagcatgcg catctcatgg acgcagctgc tttgccatcc 1700 
ttttgtcgag ggcaagctat acatagccga ggtacaggca gcacaaactt 1750 
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cgccctttat aaatccccag ctggccaagg 

ttgaggtgcg tttataacgt gtactgtagc 
tttttatgta ggcatgtagg cgcagatttg 
aaagttgagc gatgtggcca atgaaaactt 
tcaatgccat tgcgccgagt gacattgagc 
gataatgtgc atcggcttat agtgccattt 
gttgccatgc ggcactgcag cagctgctcg 
tgattaattc gcaaacctgc tttgtaagtg 
aatcatctga acgacaattt tgcaatcgaa 
caagtccatg aagtcgaagc tgaagctggc 
gtagcaagga tttggaaaag cgtaagctga 
tcgctgcgcc tgggacagag cattgacata 
tgagatgctc acgcagcaat cgcaggcaca 
cacagcagct gaagcaatcg atgcattcca 
agcgagtgag taaatgcatc catatttaaa 
tttggtttat aatagcaatt cgccgccttg 
gctgcgatga atctcagagc ccgcccattg 
ttcttgcatc gctccataca ggagctgctg 
gaagcagcac aatctagtca gcataattgt 
aggccatacc caaggtgctg cagagcgtgg 
tttgtgctgg ccgaacagca tttggtagcg 
tattgatgtc aagctggtgc ccaacttaat 
tctcgcagcg ccaccttacc gattcggctg 
ggcgtctccc tgagtcgaac cgtacgcagc 
ggagatgagc accgcctgca gcctgtacga 
atcagcagca gcagttcctc acccagttct 
gccgtcaacg acatgttcat aaattttctt 
tggacacagt gtgagacgca agcttaacca 
aaggatagca ggccggtgcg actcgctagc 
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acaccaaaaa atcacagcaa 1800 
cagctccact tatcgttcaa 1850 
ggcgatgtct tggcagcgtt 1900 
gagcacatcg cgagatagta 1950 
agctggaaac cgatgttgag 2000 
gcagatattt cctacagaga 2050 
tcgagctggt gccatgccac 2100 
gcaactccaa tatgatactc 2150 
gcgcctgctt cgagcgcaac 2200 
tctcaatata aaacagtcgc 2250 
gtcaaaattt ggataacttt 2300 
gaagtgcagc gcaaaacaac 23 50 
acagctgcag gataggaaga 24 00 
ccaacgacga gaaattgagc 24 50 
agtgaagctc tctaaagcta 2500 
tctgttgccc ggttgggaca 2550 
agaatgacga gtggctggcg 2600 
gacggcgaat ttgattcgct 26 50 
ggcgccattg cgaaactcca 2700 
cgcagctgct gtcgctgccc 275 0 
gaggccataa aaggagttta 2800 
gtacgcctgc aagctgcttc 2850 
cttcactgcc agccggcacg 2900 
tgctccgacc tgagtgccga 295 0 
gctggtctgc catctggtcc 3000 
gtgacgctgt ggcaatactc 3050 
acacatggtg agcagctggc 310 0 
ttccttgctt tgcagatttt 3150 
tgcatgctgg cattgttctg 3200 
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ttgcgttttg cgtgaactac ccgagaacgc cgagctggtg gagaaaattg 3250 
tatttgactc gcgcctacag ctggccgtcc tgctgcagag ccgtcatcat 3300 
ttgttgcgtc agcgcgcctg tcaaatgctg ttgctattgg cacgctttag 3350 
cctgcgcggc gtacagtgca tctggagtgg ggagctgaag agtgcgctcc 34 00 
5 aggcgtggcc gatgcagcaa acgtgtcaat cattgcgact ggaagccgcc 34 50 
caaacgctgg atgagcttag ccagttcagc ttctttgttg ctcaggcaac 3 500 
tgcttagtct ttattaataa ttgtacttgt atttgtttaa taaatcttaa 3550 
tccttgtcta gccgaacaga ccttccaaat tgccttgaaa gtagtcgagc 3 6 00 
agctcgtcca gatagctgct aaagccatca aagcccaaaa ggtagctacc 36 50 
10 attacagtcc tgctcgtaca tctcgtttag tttcgaaata tccttatccg 3700 
acagccgcgc gctggcccag tgaggcatac ggatcctaat gataatagca 3 750 
tgcattatta ttatttttca caatgtgtta ttcgtttaat acttataaaa 3800 
ccttaaattg tatgcatgta tgtatctatc ttatacctaa ttaatgaatg 3850 
aaatttatta j.cttgtctat ggatgtatgt gcatgtatgt atgtatgtat 3 900 
gtatgcataa aaatgtatgt tcatttataa caaacgcaga caaagataac 3950 
gatctgctgc tctacttccc gaatctcata aattcaagta cgccccgcag 4000 
atttcacgag tacatcacaa gtgttttttt ttaacaagta atgttggtat 4 050 
gtatttatgt atatatgtat ttaagtatgt atgtatttat gtatgtatgt 4100 
atgtatttat gtatgtatgt atttatgtat gtatgtattt atgtatgtat 4150 
gtatttatgt atgtatttat ttatgtatat atgtatttaa gtatgtatgt 4200 
atgtatttat gcatttatgt atttatgtat gtatgtataa gagtatgtgt 4250 
gtgtgtagat acatgtatgt atgtatgtat gtatgcgtgt atatttattt 4300 
atagtaaaca taccaacttt acttcccgct gccttgcgaa tttaaaataa 4350 
cgtattttta aatgatgccc tactcctcga ttctcaaaca tttaagtaag 4400 
ctctacaggt ttttccgatt tgattgtttt gtaaagttgt gttttttttt 4450 
ctgctcgatc tcttgtgtat tctctactct ttgtgtgcct ctctttagtt 4500 
ttctctcctt ctctcttgct ctcccctgtt ctctctctat ctctctccct 4550 
ccctctttcc acctatctca ttctctttct aagctt 4586 
<210> 27 



15 
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<211> 1055 
<212> PRT 

<213> Homo sapiens 



I^Thr Glu Pro Ala Gly Pro Asp Leu Gly Thr Pro Phe Thr Ser 
i 5 10 

Arg Leu Pro Pro Glu Leu Gin Val Leu Lys Asp Glu Gin Ala His 



20 25 



Arg Leu Ala Pro Lys Gly Asn Gin Ser Arg lie Leu Thr Gin Ala 
m 40 * 3 

L0 " 

Tyr Lys Arg Met Ala Glu Glu Ala Met Gin Lys Lys His Gin Asn 
50 55 

Thr Gly Pro Ala Leu Glu Gin Glu Asp Lys Thr Ser Lys Val Ala 
65 70 

15 Pro Gly Thr Ala Pro Leu Pro Arg Leu Gly Ala Thr Pro Gin Glu 

80 85 

Ser Ser Leu Leu Ala Gly He Leu Ala Ser Glu Leu Lys Ser Ser 
95 100 

Tr P Ala Lys Ser Gly Thr Gly Glu Val Pro Ser Ala Pro Arg Glu 
20 110 115 

AS n Arg Thr Thr Pro Asp Cys Glu Arg Ala Phe Pro Glu Glu Arg 
125 130 

Pro Glu val Leu Gly Gin Arg Ser Thr Asp Val Val Asp Leu Glu 
140 145 

25 Asn Glu Glu Pro Asp Ser Asp Asn Glu Trp Gin His Leu Leu Glu 

155 160 

Thr Thr Glu Pro Val Pro lie Gin Leu Lys Ala Pro Leu Thr Leu 
170 175 

Leu Cys Asn Pro Asp Phe Cys Gin Arg He Gin Ser Gin Leu His 



30 



185 



190 



Glu Ala Gly Gly Gin He Leu Lys Gly lie Leu Glu Gly Ala Ser 



200 



205 



His lie Leu Pro Ala Phe Arg Val Leu Ser Ser Leu Leu Ser Ser 



215 



220 



35 Cys Ser Asp Ser Val Ala Leu Tyr Ser Phe Cys Arg Glu Ala Gly 

230 235 

Leu Pro Gly Leu Leu Leu Ser Leu Leu Arg His Ser Gin Glu Ser 
245 250 

Asn Ser Leu Gin Gin Gin Ser Trp Tyr Gly Thr Phe Leu Gin Asp 

-52- 



JSDOCID: <WO 9943828A1 J_> 



WO 99/43828 PCT/US99/04 1 1 2 

260 265 270 

Leu Met Ala Val lie Gin Ala Tyr Phe Ala Cys Thr Phe Asn Leu 
275 280 285 

Glu Arg Ser Gin Thr Ser Asp Ser Leu Gin Val Phe Gin Glu Ala 
5 290 295 300 

Ala Asn Leu Phe Leu Asp Leu Leu Gly Lys Leu Leu Ala Gin Pro 
305 310 315 

Asp Asp Ser Glu Gin Thr Leu Arg Arg Asp Ser Leu Met Cys Phe 
320 325 3 3 o 

0 Thr Val Leu Cys Glu Ala Met Asp Gly Asn Ser Arg Ala lie Ser 

335 340 345 

Lys Ala Phe Tyr Ser Ser Leu Leu Thr Thr Gin Gin Val Val Leu 
350 355 3 6 o 

Asp Gly Leu Leu His Gly Leu Thr Val Pro Gin Leu Pro Val His 
5 365 370 375 

Thr Pro Gin Gly Ala Pro Gin Val Ser Gin Pro Leu Arg Glu Gin 
380 385 390 

Ser Glu Asp lie Pro Gly Ala lie Ser Ser Ala Leu Ala Ala lie 
395 400 405 

Cys Thr Ala Pro Val Gly Leu Pro Asp Cys Trp Asp Ala Lys Glu 
410 415 420 

Gin Val Cys Trp His Leu Ala Asn Gin Leu Thr Glu Asp Ser Ser 
425 430 " 437 

Gin Leu Arg Pro Ser Leu He Ser Gly Leu Gin His Pro He Leu 
440 445 450 

Cys Leu His Leu Leu Lys Val Leu Tyr Ser Cys Cys Leu Val Ser 
455 460 * 465 

Glu Gly Leu Cys Arg Leu Leu Gly Gin Glu Pro Leu Ala Leu Glu 
470 475 4 8 o 

Ser Leu Phe Met Leu lie Gin Gly Lys Val Lys Val Val Asp Trp 
485 490 435 

Glu Glu Ser Thr Glu Val Thr Leu Tyr Phe Leu Ser Leu Leu Val 
500 505 510 

Phe Arg Leu Gin Asn Leu Pro Cys Gly Met Glu Lys Leu Gly Ser 
515 520 525 

Asp Val Ala Thr Leu. Phe. Thr His Ser His Val Val Ser Leu Val 
530 535' 540 

Ser Ala Ala Ala Cys Leu Leu Gly Gin Leu Gly Gin Gin Gly Val 
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545 

Thr Phe Asp Leu Gin Pro Met Glu Trp Met Ala Ala Ala Thr His 
560 565 

Ala Leu Ser Ala Pro Ala Glu Val Arg Leu Thr Pro Pro Gly Ser 
5 575 580 



Cys Gly Phe Tyr Asp Gly Leu Leu lie Leu Leu Leu Gin Leu Leu 
590 595 

Thr Glu Gin Gly Lys Ala Ser Leu He Arg Asp Met Ser Ser Ser 
605 610 

X0 Glu Met Tr P Thr Val Leu Trp His Arg Phe Ser Met Val Leu Arg 

620 62b 

Le u Pro Glu Glu Ala Ser Ala Gin Glu Gly Glu Leu Ser Leu Ser 
635 640 

Ser Pro Pro Ser Pro Glu Pro Asp Trp Thr Leu lie Ser Pro Gin 
15 650 655 

Gly Met Ala Ala Leu Leu Ser Leu Ala Met Ala Thr Phe Thr Gin 
665 670 

Gl » Pro Gin Leu Cys Leu Ser Cys Leu Ser Gin His Gly Ser lie 
680 68b 

20 Leu Met Ser He Leu Lys His Leu Leu Cys Pro Ser Phe Leu Asn 

695 700 

Gin Leu Arg Gin Ala Pro His Gly Ser Glu Phe Leu Pro Val Val 
710 715 

Val Leu Ser Val Cys Gin Leu Leu Cys Phe Pro Phe Ala Leu Asp 

795 7 ^ 

25 

lm ,p„ leu He Val Val Leu Ala Asp Leu Arg Asp 
Met Asp Ala Asp Leu Leu lie v^x « 75Q 

740 74b 

Ser Glu Val Ala Ala His Leu Leu Gin Val Cys Cys Tyr His Leu 
755 760 

„ i n„ Tmi Pro He Ser Leu Leu Thr Arg Leu 
30 Pro Leu Met Gin Val Glu Leu Pro i±e ^ 

770 7,b 

Ala Leu Met Asp Pro Thr Ser Leu Asn Gin Phe Val Asn Thr Val 
785 790 

Ser Ala Ser Pro Arg Thr He Val Ser Phe Leu Ser Val Ala Leu 
35 800 805 

Leu. Ser Asp Gin Pro Leu Leu Thr Ser Asp Leu Leu Ser Leu Leu 
815 820 

Ala Hi* Thr Ala Arg Val Leu Ser Pro Ser His Leu Ser Phe He 
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830 835 840 

Gin Glu Leu Leu Ala Gly Ser Asp Glu Ser Tyr Arg Pro Leu Arg 
845 850 855 

Ser Leu Leu Gly His Pro Glu Asn Ser Val Arg Ala His Thr Tyr 
860 865 8?0 

Arg Leu Leu Gly His Leu Leu Gin His Ser Met Ala Leu Arg Gly 
875 880 " 885 

Ala Leu Gin Ser Gin Ser Gly Leu Leu Ser Leu Leu Leu Leu Gly 
890 895 900 

Leu Gly Asp Lys Asp Pro Val Val Arg Cys Ser Ala Ser Phe Ala 
905 910 915 

Val Gly Asn Ala Ala Tyr Gin Ala Gly Pro Leu Gly Pro Ala Leu 



920 



925 



930 



Ala Ala Ala Val Pro Ser Met Thr Gin Leu Leu Gly Asp Pro Gin 
935 940 945 

Ala Gly He Arg Arg Asn Val Ala Ser Ala Leu Gly Asn Leu Gly 
950 . 955 960 

Pro Glu Gly Leu Gly Glu Glu Leu Leu Gin Cys Glu Val Pro Gin 
965 970 975 

Arg Leu Leu Glu Met Ala Cys Gly Asp Pro Gin Pro Asn Val Lys 
980 985 990 

Glu Ala Ala Leu lie Ala Leu Arg Ser Leu Gin Gin Glu Pro Gly 
995 1000 1Q05 

He His Gin Val Leu Val Ser Leu Gly Ala Ser Glu Lys Leu Ser 
1010 1015 1020 

Leu Leu Ser Leu Gly Asn Gin Ser Leu Pro His Ser Ser Pro Arg 
1025 1030 X o35 

Pro Ala Ser Ala Lys His Cys Arg Lys Leu He His Leu Leu Arg 
1040 1045 105Q 

Pro Ala His Ser Met 
1055 
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